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DRAVIDIAN SUBSTRATUM AND 
INDO-ARYAN LANGUAGES 


PANCHANAN MOHANTY 
University of Hyderabad 


1. Introduction 


It is the possession of language and culture that makes human 
beings unique as well as very different from other animals. In fact, the 
development of both language and culture has not been understood 
fully even now. When it is quite possible to train a pet such that it 
acquires some cultural traits of human beings, it cannot pass them on to 
its progeny. On the contrary, humans do hand over their language and 
culture to their descendants. So scholars hold the view that there is a 
close relationship between language and culture, and they have 
developed in correlation with each other. Kroeber (1948: 225) states: 
"Culture, then, began when speech was present, and from then on, the 
enrichment of either meant the further development of the other." That 
is why, language has been said to be an index of culture. Language and 
culture are products of the human brain. Both required a more 
developed brain so that they could expand. Due to migration of human 
beings from one place to another, various linguistic and cultural groups 
came in contact with each other and, as a result, exercised mutual 
influence. India bears probably the most significant testimony to this 
kind of contact and that is why it is truly a multilingual, multicultural, 
multiethnic country. There have been a number of studies in this area 
since the epoch making publication of Emeneau (1956). In this 
presentation, I wish to examine more data from the four major language 
families of this country in general and Indo-Aryan and Dravidian in 
particular to show the depth and range of their convergence. 


* Presidential Address delivered at the 35th All India Conference of Dravidian 
Linguists held at Mysore University, Mysore from 21st to 23rd June 2007. 


ijdl Vol. XXXVII No. 1 2 Mohanty 
2. The Linguistic Situation in India 


It is more or less accepted that out of the four major language 
families found in India, the Dravidian and Indo-Aryan speakers have 
entered from the north-western side whereas the Munda speakers have 
entered from the north-eastern side. This is the reason for which we see 
a lot of shared characteristics between Dravidian and Indo-Aryan on 
the one hand and Munda and Tibeto-Burman on the other even in the 
pre-historic times. 


India has been a linguistic area because there was contact 
leading to convergence among various linguistic stocks at different 
periods. It must be made very clear here that contact does not straight 
away lead to convergence. Contact between two linguistic groups 
facilitates bilingualism. As both the speech communities become 
bilingual, they codemix and codeswitch quite frequently and also start 
using the other language structure while speaking their mother 
languages. This is what is called substratum influence. A careful 
analysis of the Neo Indo-Aryan (NIA) languages reveals that there is a 
very strong Dravidian substratum in this country. | 


3. Language as a Tool to Reconstruct the Cultural Past 


It is now widely accepted that language is a repository of 
information about the cultural past. Seton-Watson (1981:2) states: "The 
history of language is not just a subject for philologists; but forms a very 
important part of social history, and one which seems to be relatively 
neglected by most historians." Burke (1987:17) also expresses a similar 
view: "Linguistic forms, their variations and changes, also tell us some 
thing about the quality of social relationships in a given culture, or 
cluster of cultures." and "..language is too important historically to 
leave to the linguists - so intimately involved with the process of social 
interaction and social change that social historians need to give it much 
more attention than they have done so far;...". But use of language as a 
tool in reconstructing social history is not popular and linguistic 
orientation in the study of social history is yet to take off in India. It can 
easily be demonstrated that linguistic clues are significant in the 
reconstruction of social history. Let us take a couple of examples from 
the first major Oriya poet Sarala Dasa’s SaraLa: Maha:bha:rata which 
was written early in the 15th century. There are Dravidian words like 
/mudusuli/ ‘an old maid in a palace’, /meka:pa/ ‘bodyguard’, /kuTTuNi/ 
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‘a woman messenger’ in it. Not only these, the sources of Oriya 
surnames like Lenka, Malia and Muduli are, in fact, Dravidian. The 
first one means ‘servant’ and the remaining two, ‘headman’. Then, there 
is an extremely popular proverb in Oriya, i.e. /a:paNa: suna: bheNDi/ 
‘Own gold is a useless thing.’ It is normally used in a situation like the 
following: When a son does not look after the parents though it is the 
norm in our society. The parents blame the son by saying that he is not 
gold; he is silver. In other words, he is not an ideal son, but a bad one. 
Oriya dictionaries give two meanings for /bheNDi/: One is the vegetable 
‘lady’s finger’ and the other is ‘a useless thing.’ Interestingly, this 
proverb is the only example where /bheNDi/ occurs in the second 
meaning. Also, it is difficult to accept ‘gold’ to become ‘a useless thing’. 
Actually, it is the Dravidian word /weNDi/ ‘silver’ that has become 
/bheNDi/ in Oriya pronunciation. It also perfectly fits in the context of 
the meaning of the proverb because the son is not precious like gold, he 
is a cheaper metal like silver. There is another proverb /ka:Li kotari, 
hesa muturi/. Interestingly, many Oriya dictionaries (e.g. Gopal 
Chandra Praharaja’s purNacandra oDia: bha:sakosa) have not even 
listed /kotari/ in them. However, it is certainly related to the Telugu 
word /ku:turu/ (1873) ‘girl/daughter’. The meaning of this proverb is: 
‘The girl is dark-complexioned and wets the mat (while sleeping).’ So 
the implication is that she is not a good match for marriage. 


Again, there are many place-names in Central India which are 
Dravidian, e.g. Chilika, the famous natural lake of Orissa, means ‘a 
river’ or ‘a big stretch of water’ in Dravidian (DEDR 2569). A river 
named /ka:ThajoDi/ flows on the southern side of Cuttack town. 
Though I am not sure of the source of /ka:Tha/, there is little doubt that 
/joDi/ is nothing but Tamil /co-r/, Kannada /juri/ (DEDR 2883). The 
same /joDi/ is also found in /naijoDi/ ‘a low lying narrow stretch on the 
road-side through which water flows.’ Besides, there are a number of 
places with the Dravidian suffix /-peT/, /-peTa:/ (e.g. court-peTa), /-koT/ 
(e.g. /khalikoT/, /dhara:koT/) and /-Diha/ (e.g. /ganjeiDiha/) in Orissa. 
/Diha/ is also an Oriya word meaning a piece of home-stead land. I 
should mention here that Rajkot in Gujarat, Faridkot in Punjab, and 
Giridih in Jharkhand also possess the above mentioned suffixes. All 
these examples provide evidence in support of a strong Dravidian 
substratum not only in the Orissa, but also in Gujarat, Punjab and 
_Jharkhand/Bihar. My contention here is that linguistic analysis has to be 
built into the system of socio-historical reconstruction of a society. 


*- 
FP: 
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4. Sanskrit vis-a-vis Neo Indo-Aryan 


We are aware of the wide-ranging influence of Sanskrit on 
different Indian languages. But what is not much discussed is the 
influence of other Indian languages which converted Sanskrit, an 
Indo-European language, into Prakrits within five hundred years of its 
arrival on the Indian soil and that this change started from Central 
India, the meeting point of Dravidian and Aryan languages and 
cultures. Let us call this phenomenon Indianisation or nativisation of 
Sanskrit and take the example of consonant cluster simplification to 
illustrate it. It should be pointed out here that consonant clusters are a 
characteristic feature of Sanskrit whereas lack of such clusters is a 
significant aspect of the genius of the Dravidian languages. The 
consonant clusters found in modern Dravidian languages are mainly 
due to borrowing from Sanskrit. Interestingly, Prakrits, the next stage of 
Sanskrit, do not possess these clusters. In the initial stage of the 
Prakrits, these clusters were geminated and at a later stage they got 
simplified by way of deletion of the weaker consonant and lengthening 
of the preceding vowel in every word. If we compare the NIA languages 
like Punjabi and Hindi with Oriya and Bengali a clear trend will 
emerge. Consider the following examples: 


Punjabi Hindi Oriya Bengali Gloss 
maTTi: maTTi: ma:Ti ma:Ti ‘clay 
haDDaji: haDDi: ha:Ra ha:R ‘bone’ 
ajj aj afi aj ‘today’ 
kaTT ka:T ka:T ka:T ‘to cut’ 
kann ka:n ka:na ka:n ‘ear’ 
dand da.t da:ta da:t ‘tooth’ 
hatth ha:th ha:ta ha:t ‘hand’ 
panj pac pa:ca pac ‘five’ 
aTTh a:Th a:Tha a:Th ‘eight’ 
pra: bhay bhasi bha.y ‘brother’ 
piTTh pi:Th piThi pith ‘back’ 
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Notice that in the first two examples, Punjabi and Hindi have 
geminates and in the rest only Punjabi preserves geminates; but these 
geminates have been simplified in Oriya and Bengali. The point I am 
trying to make here is that in respect of the above characteristic, 
Punjabi is the closest to Sanskrit, then comes Hindi as it retains some of 
them; and Oriya and Bengali are the farthest from Sanskrit because 
they have simplified the geminates. 


_ Let us take another example from culture. The Vedic Aryans 
used to worship gods by way of /homa/ in which they offered flesh or 
ghee to fire. The ritual is surely indicative of a cold climate where fire is 
bound to become sacrosanct and people are forced to eat meat due to 
the scarcity of vegetation. But in contrast the present day Indians 
usually do /pu.ja:/ instead of /homa/, and flowers, leaves, food grains, 
and fruits are the essential ingredients in it. Firstly, it must be noted that 
/pu:ja:/ was completely unknown to the Vedic Aryans as well as their 
Indo-European counterparts whereas it was very much in vogue among 
the Dravidians. Secondly, the word /pu.ja:/ does not have cognates in 
other Indo-European languages like Latin, Greek, Hittite, etc. Thirdly, 
there is little doubt that both /pu:ja:/ ‘worship’ and /puSpa/ ‘flower’ of 
Sanskrit are related to the Dravidian root /pu:-/ ‘flower’ (DEDR 4345) 
and are later entrants into this language. On the basis of all these, it can 
be argued that Indianisation or nativisation of the Sanskrit language as 
well as culture had started from Central India, and the driving force 
behind it was Dravidian influence. 


5. Some Examples of Deep Convergence 
5.1. The custom of /ma:rkuNDi/ and /kuilibara/ 


In the typical Aryan India, i.e. in the states of Jammu and 
Kashmir, Punjab, Haryana, Himachal Pradesh and Uttar Pradesh as 
well as in the four Dravidian-speaking states, i.e. Tamil Nadu, Kerala, 
Karnataka and Andhra Pradesh, there exists a normal relationship 
between married women and their husbands’ elder brothers. On the 
contrary, a relationship of avoidance is observed between them in the 
Munda and Tibeto-Burman speaking communities. It is also customary 
for a younger brother to marry his elder brother’s wife after the latter’s 
death in the communities belonging to both the Munda and 
Tibeto-Burman stocks. In the anthropological jargon this system is 
called ‘levirate’. However, I want to mention in this context that 
similarly no married woman is allowed to talk with, or even look at, her 
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husband’s elder brother and vice-versa in the Oriya society. A similar 
custom is prevalent in Bengal, Bihar and Jharkhand also. Marriage of a 
person’s wife with his younger brother is found taking place 
occasionally on the 12th day of his death only in the lower castes in 
Orissa. But both a man’s brother and his wife enjoy a very close and 
joking relationship in the whole Oriya society. The same is found in 
other parts of India also. There is another important thing to be 
mentioned here. When an Oriya bride-groom goes in a procession to 
marry and bring home the bride, normally one of his younger brothers, 
dressed like a groom and called /ma:rkuNDi/ in Standard Oriya, 
accompanies him. Though it is termed a native word by the Oriya 
lexicographers, it is actually a Dravidian compound consisting of /ma-r/ 
(4834) and /kuNDi/ (1173) meaning ‘substitute husband’. In northern 
Orissa, /kuili bara/ is used instead of /ma:rkuNDi/. Here the source of 
/bara/ is Indo-Aryan and it means ‘bridegroom’. Though /kuili/ 
(Standard Oriya /koili/) is normally accepted as the singing bird /kotli/ 
‘cuckoo’, this is a chance similarity and they have no connection at all. It 
is undoubtedly related to Dravidian /kowil/ ‘temple’. This argument is 
evidenced by the fact that /kuili buDa/ (Standard Oriya /koili buDa/) 
refers to the bride’s morning bath after which she goes to a temple. So 
this expression will have a bizarre meaning if /kuili/ is understood as 
‘cuckoo’ whereas it will be quite appropriate, i.e. ‘temple bath’ if we link 
~ [kuili/ to Dravidian /kowil/ ‘temple’. This interpretation of /kuili/ also 
gives us a striking similarity between /ma:rkuNDi/ and /kuilibara/. In all 
probability, this custom of /ma:rkuNDi/ accompanying the groom is 
meant for everybody involved in the marriage to know that there is 
somebody to step into the shoes of the real bride-groom in the event of 
the latter’s demise. A lot of similarities can be seen between the custom 
prevalent in the Munda and Tibeto-Burman societies and that of the 
Oriyas. So it is evident that the prevalence of this custom in the Oriya 
society is due to its convergence with the surrounding Munda 
communities, but it is expressed in and through Dravidian words. 


5.2. /nita:ni/: an uncommon Oriya word for ‘milk’ 


Studies on the Tibeto-Burman societies reveal that they never 
use milk; rather they abhor it. Regarding the Mishmis, Dalton (1978:15) 
states: "They do not use them (cattle) for agricultural purposes or for 
their milk." The Hill-Miris, according to him, also do not touch milk. 
(ibid.: 31). Again, with reference to the Garos he writes "... they have no 
aversion to any food, except milk, which they abominate..." (ibid.: 62). 
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The very same trait is found among the Mundas also. The following 
_ words from Dalton (1978:195) validate this observation: "The Kols 
plough with cows as well as oxen, but it is to be recollected that they 
make no other use oi the animal as they never touch milk." Frazer 
(1986:293) has also reported that this practice is in vogue among some 
hill tribes, especially the Hos: "The cattle are used only for ploughing 
for the Hos, like many other hill tribes of India they never touch milk." 
In other words, for both the Munda as well as Tibeto-Burman peoples, 
milk is forbidden. This aspect of Munda culture is reflected in the 
language as well. Most Munda languages possess phonetically 
unrelated words for milk. For example, it is /Da:ktar ~ Da:?tar/ in 
Bonda, /toa/ in Santali, /a:duban/ in Sora, /Dra:/ in Didayi, /nunu/ in 
Kharia, and /dud ~ Dud ~ DiDom/ in Korku, etc. Notice that though 
there are lots of variations with reference to the word for milk, every 
Munda language uses either /da:/ or one of its variants for water. I want 
to argue here that the word for ‘water’ has been inherited by the Munda 
languages from their parent whereas the words for milk have been 
either coined or acquired later from other sources. 


Data from various languages spoken all over the world shows 
that it is an established norm to use an equivalent descriptive and 
circumlocutory expression for a forbidden or taboo object. This is the 
reason Sanskrit, Greek, and Latin, though daughters of the same 
Proto-Indo-European parent, use three different words, 1e. /candra/ 
literally ‘shining’, peyy’api /fengan/ (Modern Greek) ‘that which shines’, 
and /lu:na/ ‘that which glitters’, respectively, for the moon. Similarly in 
rural Orissa, people often use /ha:Ria:Ni/ ‘scavengress’ for the house 
lizard and /lambajantu/ ‘long animal’ for the snake, especially at night. 


It is interesting to note that all the four major Dravidian 
languages, 1.e. Tamil, Telugu, Kannada and Malayalam have only one 
word (i.e. pa: ~ pa.lu ~ ha:lu) for ‘milk’. But if we count the words for 
‘milk’ in Oriya we will find at least six, e.g. /gurasa/, /naLa:pa:Ni/, 
/okhara/, and /nitia:ni/, besides the commonly used /khira/ and /dudha/. 
Again, there is only one word /dahi/ for ‘curds’, and another, /ghia/ for 
‘ghee’, used all over the Oriya-speaking tracts. My contention is that 
‘milk’ has become a taboo in Oriya under the Munda influence, and 
that is why there are so many circumlocutory expressions to refer to it. 
/gurasa/ is derived by adding /rasa/ ‘juice’ to /go/ ‘cow’ and it literally 
means ‘cow’s juice’. /naLa:pa:Ni/, which apparently seems strange and 
unintelligible, is actually a descriptive expression. It is a blend of the 
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Dravidian word /nara:/ ‘white’ and Ortya /pa:Ni/ ‘water’. In other words, 
it means ‘white water’. I should mention here that as there is no word 
like /nara:/ in Oriya, and the Oriya speakers have changed it to an 
existing and familiar Oriya word /naLa./ ‘drain’ without realizing the 
significance of the former. Of course, each language undergoes many 
such changes in its growth and development. Then /okhara/ is also a 
descriptive expression consisting of two Dravidian words: /o/ 
‘appropriate, equal, fit’ and /kara/ ‘sap issuing from trees, gum, juice of 
fruit, etc.’ This /kara/ has become /khara] due to stress on the 
non-initial syllable as it has happened in the case /sakhaLa] (/saka:La/) 
‘morning’ or /majbhut] (/majbut/) ‘strong’. The literal meaning of 
/okhara/ is ‘appropriate, equal or fit (i.e. drinkable) sap or juice’. Thus, 
all these words are circumlocutory expressions for milk. Let us now 
consider /nitia-ni/. 


Though the Savara (Munda) god Jagannatha had been there 
since time immemorial, he gained importance only after the Dravidian 
Gangas had ascended the throne of Orissa. The Somavamshis, the 
Bhanjas and the Nandas, who occupied the throne of Orissa before the 
Gangas, were worshippers of Siva. So it is obvious that Jagannatha did 
not gain much importance during their rule. The present gigantic and 
architecturally marvellous Jagannatha temple of Puri was in fact built 
by Anantavarmana Chodaganga Deva (1078-1150), founder of the 
Ganga dynasty in Orissa. The devada:si: (i.e. God’s maid-servant) 
dance ritual in the temple was also introduced by the Gangas. Thus, it 
can be claimed that the process of Dravidianization of Lord Jagannatha 
and his culture started with the Ganga rule, though Dravidian influence 
on the language and culture of Orissa certainly existed before that. The 
presence of an enormous number of Dravidian words in the temple 
register bears testimony to this claim. For example, /lenka:/ ‘servant’, 
/muduli/ ‘chieP, /kuRua:/ ‘earthen pot’, /koili baikuNTha/ ‘temple 
Vaikuntha’, /chera: pahara:/ ‘dirt-sweeping’, etc. 


My contention is that /nitia:ni/ is also a Dravidian word. It should 
be recalled that during his visit to Puri (at the end of the 11th century or 
the beginning of the 12th century), Ramanuja (1056-1137), the great 
Vaishnava theologist of the Chola empire, could not succeed in getting 
Brahmin priests appointed for the worship of Lord Jagannatha. From 
this, it is evident that Savaras or Daitas were worshipping J agannatha at 
that time (Sahu 1996:37-38). In other words, until then the Munda 
culture was dominant in Puri as well as in the temple of Jagannatha. So 
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it is obvious that, along with other customs, the use of milk by the 
Dravidian Gangas must have been detested by the people, and there 
must also have been a strong resistance to its introduction in the 
temple. It could have been one of the reasons which forced 
Chodaganga, who was a /paramama:heswara/ ‘staunch Saivite’ on his 
own admission in 1112, to proclaim himself a /paramabhaTTa:rakah- 
paramavaiS Navah-paramabra:hmaNya/ ‘staunch Saivite, staunch 
Vaishnavite, and staunch Brahmanical’ in 1135 (Rajaguru 1972:40). 
Thus, finally both the camps most probably settled for a compromise, 
and in this process milk was renamed as /nitia:ni/. In fact, it is derived 
through the combination of two Dravidian words /ney/ ‘ghee’ and 
/taNNi/ ‘water’ as follows: 


ney + taNNi > nita:ni > nitia:ni 
(literally meaning ‘ghee water’) 


5.3. Inclusive-Exclusive Distinction in the Oriya Verb 


There is another telling example of the Dravidian influence on 
Oriya, i.e. the inclusive-exclusive distinction in the finite verb when the 
subject is the first person plural pronoun. It should be mentioned here 
that Dravidian languages use two different first person plural pronouns 
out of which one includes and the other excludes hearer(s). Consider 
the following Telugu examples: 


(1) me:mu annam tiNTa:mu 
we (excluding the hearer(s)) rice will eat 
‘We (excluding the hearer(s)) will eat rice.’ 


(2) manamu annam tiNTa:mu 
we (including the hearer(s)) rice will eat 


‘We (including the hearer(s)) will eat rice.’ 


In (1) /me:mu/ excludes the hearer(s) whereas in (2) /manamu/ 
includes the hearer(s). Though some NIA languages, like Marathi and 
Gujarati, show a similar inclusive-exclusive distinction in the first 
person plural pronouns, this characteristic is noticed in the finite verbs 
in Oriya. 


It is well known that pronouns form a closed system which is 
resistant to changes. So it is obvious that when Oriya acquired the said 
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feature from Dravidian, it could not make suitable modifications in the 
existing pronominal system to accommodate the change. Rather what it 
did was that a change was brought about on the inflectional of verbs; as 
a result of which the following structures appeared in this language: 


(3) a:me jac:uchu 
we are going (excl.) 
‘We are going (without the hearer(s)).’ 


(4) a:me ja:uche 
we are going (inclu.) 


‘We are going (with the hearer(s)).’ 
Thus, this is a Dravidian characteristic retained in Oriya. 


The other characteristic noticed in Oriya and Bengali is to use 
the verb /a:s-/ ‘to come’ while going. I have seen North Indians and 
foreigners getting perplexed when Oriya and Bengali speakers use "I 
am coming” when actually they take leave. This feature is found in 
almost all Dravidian languages. So it is another Dravidian feature found 
in Indo-Aryan languages like Oriya and Bengali. 


5.4. Number Markers in Oriya and Bengali 


With reference to its number markers, Oriya uses two mutually 
exclusive number markers, i.e. /-mma:ne/ and /-guDa:/ very frequently. 
The former is attached predominantly to human nouns whereas the 
latter, to non-human nouns. The following examples are illustrative: 


(5) azi mo bha:zima:ne a:sichanti 
today my brothers have come 


‘My brothers have come today.’ 


(6) ayi mo bahiguDa: a:sichi 
today my books has come 


‘My books have arrived today.’ 


It should be pointed out here that the inanimate plural marker 
/-guDa:/ was not there in Old Oriya. It is attested very rarely in Sa:raLa 
Maha-bha-rata, the first major composition in Oriya written by Sarala 
Das in 15th century. So it can be accepted as a later interpolation. But 
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the distinction between human and non-human plural markers is found 
in Sa:raLa: Maha:bha.rata. It predominantly attaches /-ma:ne/ to human 
bases and /-ma:na/ to non-human bases. In most of the cases where 
/-ma:ne/ and /ma:na/ are used with non-human and human bases 
respectively, the cause is the rhyme-scheme; <i in a very few cases it 1s 
due to carelessness of the scribes. 


However, in the course of time /-guDa./ has replaced /-ma-na/ as 
a non-human plural marker, and it is a recent phenomenon. Though the 
use of /-ma:ne/ with human bases and /-guRa:/ with non-human bases 1s 
the norm in Modern Oriya, sometimes these can be used with 
non-human and human bases respectively. For example: 


(7) ga:ima:ne a:suchanti 
COWS are coming 


‘Cows are coming.’ 


(8) pila:guDa: a:suchanti 
children are coming 


‘Children are coming.’ 


In fact, here /-ma:na/ and /-guDa:/ are used deliberately to 
promote /ga:i/ ‘cow to [+human] level and demote /pila:/ ‘child’ to 
[-human] level. In other words, /ga-i/ ‘cow gains respect with the 
attachment of /-ma:ne/ whereas attachment of /-guDa:/ expresses a 
sense of contempt for /pila:/ ‘child’. 


Though the source of /-ma:ne/ is not resolved yet, I have little 
doubt that /-guDa:/ is Dravidian. It is a cognate of /-kaL/ in Tamil and 
Malayalam, /-gaLu/ in Kannada, /-kulu/ in Telugu, and /-ku/ ~ gul/ in 
Parji, Kolami, Gadaba, etc. (Andronov 2001:116, Krishnamurti 
2003:214-215). Chatterji (1970:97) has tried to derive the Bengali 
non-human plural marker /-gu/a/ from /-kula/ of Sanskrit while denying 
any connection between these plural markers in Bengali and Dravidian 
(ibid.: 733). But Sen (2004:213) was sceptical regarding Chatterji’s 
source and that is why, he has derived it from /go/a:/. But there is hardly 
any convincing evidence to support these views. On the other hand, like 
Oriya /-guDa:/, it can easily be related to the above mentioned 
Dravidian plural maker. 


Bengali has two other human plural markers /-ra:/ and /-der/, 
which are attached to the nominative and oblique forms of nouns and 
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pronouns (e.g. /a:mra:/ ‘we’, /a:ma:der/ ‘our’). As in the case of /-gula/, 
Chatterji (1970: 737) has stated: "The evolution of the <-r-a., -e:r-a:> 
affix for the plural is comparatively a late thing in Bengali, and it cannot 
be in its origin a Dravidian borrowing, nor should it be compared with 
the Dravidian plural <-ar, -a:r>found in Tamil as well as in the 
Dravidian speeches adjoining Bengali, viz., Oraon and Maler." It should 
be noted here that, like the Dravidian suffix, (e.g. Telugu /ba:lu + mu/ 
‘boys’, Kui /a:ba+ru/ ‘fathers’(Andronov 2001: 113), Kannada 
/aras+ar/ ‘kings’, Gondi /kaND+ir/ ‘boys, Pengo /ToND + ar/ 
‘brothers’, Kolami /ma:s+ur/ ‘men’, Parji /tol+er/ ‘brothers’, Ollari 
jil+er/ ‘bridegrooms’, Kurux /a:l+ar/ ‘men’ (Krishnamurti 
2003:214-215), the Bengali /-ra:/ is a human plural suffix. Similarly, /-der/ 
is also a plural suffix in Kannada. Let us take the followmg examples 
(Andronov 2001: 113) from different Kannada dialects: 


Singular Plural Gloss 


ma:ma:  ma:ma:deru mother’s brother (Gulbarga) 
aNNa aNNadi:ru brother (Halakki) 
akka akkadi:ru sister (Nanjangud) 


It is not at all a coincidence that the Oriya /-guDa:/ and these 
Bengali plural markers appeared in the concerned languages around 
the same time. In fact, during 14th and 17th centuries a number of 
Dravidian characteristics appeared in Oriya and Bengali and it is a 
matter of further research to determine why and how it happened in 
that period. Sen (1971:vi) has categorically rejected any Dravidian 
influence on Bengali: "It is now usual in linguistic treatises on 
Indo-Aryan to hunt for a Dravidian source. I have not done so for the 
simple reason that, with the exception of one or two (such as the name 
Balla:la) no word in early Bengali could possibly be assigned directly to 
Dravidian." However, Chatterji (1970:64) has accepted Dravidian 
influence on Bengali as he states: "There is, of course, the presence of 
the Ko:] and Dravidian speakers (the Santals, the Maler, the Oraons) in 
the western fringes of the Bengali area...There are, again, some 
unmistakably Dravidian affinities in Bengali phonetics, morphology, 
syntax and vocabulary: but these agreements with Dravidian are not 
confined to Bengali alone, but are found in other NIA. also." 
Considering all these, it is more reasonable to accept all these Bengali 
plural suffixes to have been taken from Dravidian. 
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5.5. The Mother’s Brother in Indian Languages 


The kin term for ‘mother’s brother’ is an interesting example in 
many Indian languages. Sanskrit uses /ma:tula/ for mother’s brother 
and it is different from both /ma:ta:maha/ ‘maternal grandfather’ and 
/pita:maha/ ‘paternal grandfather’. Greek has /me-tro.s/ for ‘mother’s 
brother’ which is an analogical creation after /pa’tro:s/ ‘father’s brother’. 
According to Benveniste (1973:212), Sanskrit /ma-tula/ and Greek 
/me:tro:s/ "are recent substitutes for an Indo-European designation 
which disappeared when the mother’s brother ceased to have a 
privileged position with respect to the father." It is sigmficant and 
interesting to note that in the Indian society, mother’s brother is the 
dearest kin for all people and he is the one who always helps his sister’s 
children at the time of the latter’s needs. Krishna’s bringing up of his 
sister Subhadra’s son, Abhimanyu in the great Indian epic 
Maha:bha:rata is a good example of it. So it can be argued that the 
cordial relationship between mother’s brother and sister’s son is at least 
as old as the Maha:bha:rata. It should be taken note of here that on this 
account the society represented by Sanskrit is different from the 
societies represented by other Indo-European languages. In other 
words, it is an innovation in the Indo-Aryan society. But the question 
that one can ask is: What is the source of this innovation? We will get 
the answer if we look at the words for mother’s brother in other Indian 
languages belonging to the Indo-Aryan, Dravidian and Munda stocks. 
These languages normally use /ma:ma:/ or its variants for mother’s 
brother. Let us take examples from two languages of each group. 


Indo-Aryan Dravidian Munda 


Oriya: ma:mu Telugu: ma:ma Bonda: ma:mu, ma:mun 
Hindi: ma:ma: Tamil: ma:ma:,ma:man  Kharia: ma:mu 


Among the Tibeto-Burman group of languages, some use the 
term derived from /ma:ma:/ whereas others use different terms. For 
example, Bodo uses /ma:y/, obviously due to its close contact with 
Assamese; but Aka, spoken in Arunachal Pradesh, uses /as/ for 
maternal uncle. What is interesting is that except Indo-Aryan, in all 
other stocks, the word for mother’s brother also denotes "father’s 
sister’s husband’. Only in Dravidian it refers to ‘spouse’s father’ along 
with ‘mother’s brother’ and ‘father’s sister’s husband’. Thus, a hierarchy 
can be established among these stocks with reference to the meanings 
_ of the above said word. 
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Dravidian Munda/Tibeto-Burman Indo-Aryan 


mother’s brother = mother’s brother = mother’s brother 
father’s sister’s husband father’s sister’s husband 
= spouse’s father 


In other words, the Dravidian society allows exchange of sisters 
and cross-cousin marriage; which can be represented diagrammatically 
as follows: 


FaSi <—————-- Fa ———— _- Mo == _~MoBr 


X -———> Y 


The Munda and the Tibeto-Burman societies allow only the 
exchange of sisters. Thus, they form a part of the Dravidian system, and 
it can be schematised as follows: 


FaSi —————- Fa = -—Mo=== ~IMoBr 


Finally, the Indo-Aryan society allows neither of these. 


It will not be out of place to discuss an important aspect of the 
practice of marriage among the Dravidians. Even though a Dravidian 
male has an option to marry either his mother’s brother’s daughter or 
his father’s sister’s daughter, the former is invariably preferred. The 
question is: Why is it so? We all know that giving a daughter in marriage 
is considered to be an act of religious merit for her parents or those 
who officiate on their behalf. This is popularly known as kanya:-da:na 
or ‘bride-gift’ which was originally practised by the Brahmins only and 
its sole motive was to earn dharma or piety. Later it spread to other 
castes through the Brahminisation process. Thus, it is clearly a religious 
gift and "... the bride’s people must accept nothing in return; the 
slightest gift from the groom’s side would be construed as the "price" of 
the daughter, to accept which would constitute sale. The groom and his 
people can expect gifts, hospitality and acts of deference from the 
bride’s people, but must not reciprocate. ...The groom’s party are the 
superiors and benefactors of the bride’s." (Trautman 1995: 292). For 
this reason, it is considered unethical and irreligious on the part of the 
bride-givers to accept anything from the bride-takers. Many 
bride-givers do not drink water in the bridegroom’s house even today. 
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On the other hand, they regularly keep sending gifts and presents to the 
latter and treat him and his family members with humility and respect. 
In fact, "The idiom of kanya:da-na is the patrilineal idiom of complete 
dissimilation of the bride from her family of birth and her complete 
assimilation to that of her husband." (Trautman 1995:291). Though it is 
a typical Indo-European custom inherited by the Indo-Aryans, it 1s 
prevalent especially among the upper caste Hindus all over India 
including the Dravidians. Actually, as pointed out by Trautmann 
(1979:170), the root of preference for mother’s brother’s daughter over 
father’s sister’s daughter lies here. Because after one’s sister is given in 
marriage to another person, presents and hospitality always flow from 
the former to the latter. Then, if the former’s son marries the latter’s 
daughter the flow has to be reversed; and this leads to a conflict of 
interests. On the other hand, if one marries one’s mother’s brother’s 
daughter the question of such a conflict does not arise as the flow of 
presents and respect which were coming from the latter for his sister 
earlier will flow to his daughter. This is the most obvious reason for 
which the Dravidian males prefer to marry their mothers’ brothers’ 
daughters to their fathers’ sisters’ daughters. 


The other point to be noted here is that all the Dravidian 
languages use /ma:ma:/ or its variants. The present Indo-Aryan 
languages also use the same instead of the Sanskrit /ma:tula/ or any 
term derived from it. So it can be argued that Indo-Aryan has taken the 
term for maternal uncle from Dravidian, and this indicates that a 
deeper cultural convergence has taken place in this subcontinent over 
the millennia. | 


5.6. Some Verbs, Private Body Parts and Adjectives in Indo-Aryan 


In his discussion on borrowing, Anttila (1972:155) has remarked: 
"..generally the more abstract the element is, the more difficult is the 
transfer." Needless to say that verbs and adjectives are more abstract 
than nouns. A careful analysis of the Oriya vocabulary reveals that, 
there are many verbs and adjectives which have been taken from 
Dravidian, e.g. /kumL-/ ‘to be extremely happy (DEDR 1774), 
/kuRha.-/ ‘to make a heap’ (DEDR 1882), /kheNTa:-/ ‘to backbite’ 
(DEDR 1948), /kuT-/ ‘to pound’ (DEDR 2063), /kor-/ ‘to scrape’ 
(DEDR 2257), /cakaT-/ ‘to knead’ (DEDR 2266), /cank-/ ‘to move’ 
(DEDR 2281), /chec-/ ‘to crush’ (DEDR 2322), /sa:kula:-/ ‘to console’ 
(DEDR 2427), /jhul-/ ‘to swing’ (DEDR 2889), /bhiN-/ ‘to card 
-(cotton)’ (DEDR 4160), /peR-/ ‘to squeeze’ (DEDR 4231), /pot-/ ‘to 
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bury (DEDR 4509), etc. Verb roots like /kuT-/, /jhul-/ are also used in 
other NIA languages, like Bengali and Hindi. Then, the Oriya verb 
/geh-/ (DEDR 1957) ‘to copulate’ is derived from Dravidian /key-/ ‘to 
do’, which is also used in the same sense in many Dravidian languages 
(see Kuiper 1958:239). Another Oriya verb root /gaLa:-/, which has a 
cognate in Dravidian (DEDR 1299), is also used in the same sense. Not 
only that, the names of some private body parts used in Indo-Aryan 
have their sources in Dravidian, e.g. Oriya /ga:NDi/ ‘buttocks’ and 
/ba:NDa/ ‘penis’. Turner (1966) has derived these words from Sanskrit 
/gaNDa/ ‘goitre, boil’ (3997, p. 213) and /baNDa/ ‘maimed (in hands, 
feet or tail), impotent’ (9124,- p.514) respectively. These seem just 
chance similarities to me, because the meanings do not match at all. On 
the other hand, there are cognates available in Dravidian that are quite 
comparable from both phonological as well as semantic points of view. 
Let us take examples from Telugu: /gaNDi/ ‘hole, orifice, breach, gap, 
lane’ (DEDR 1176) and /baNDa/ ‘obscene, shameless’ (DEDR 3902). 
Both these words are also used in Bengali whereas only the former is 
used in Hindi, Maithili, Gujarati, Marathi and Nepali. I should state 
here that different Oriya dialects use other words for private body-parts 
whose sources are Dravidian, e.g. /kuLi/ (DEDR 2244) and /suna:/ 
(DEDR 2666) both used to refer to a male child’s genital in Northern 
Oriya. Besides, most standard as well as dialectal Oriya words for 
‘small’, e.g. /sad:na/ (DEDR 2594), /kaTi/ (DEDR 1851), /kuni/ (DEDR 
1646) are taken from Dravidian. Even Nepali uses /sa:na/ for ‘small’. 
Then there is a Dravidian language called Dhangar spoken in Nepal. 
All these hint at the extent to which the Dravidian languages had 
spread in earlier times. Again, presence of more Dravidian words in the 
dialects than those in Standard Oriya provides a strong clue that 
dialects and non-standard varieties carry and retain more convergence 
features than the standard language. 


5.7. Dravidian Writing System in Oriya, Bengali, Assamese and Maithili 


Let us now discuss the alphabets of languages like Oriya, 
Bengali, Assamese and Maithili (Tirhuta). But before that, it will not be 
out of place to say a few words regarding the round shape of Oriya 
letters. The dominant view on this issue is that Oriya was written on 
palm-leaf with an iron stylus in the initial stage. Use of a straight line on 
the top of these letters, ike Devanagari, would have caused the 
palm-leaf to slit. So in order to avoid this damage these letters have 

been given a round shape. But I have argued elsewhere (Mohanty 1996) 
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that this position is not tenable because of two reasons: First, in the 
initial stage of its development, Oriya writing took place on stone pieces 
and palm-leaf was used later. Secondly, in olden days palm-leaf was 
also used for writing Bengali and Assamese; but, unlike Oriya, the 
shape of Bengali and Assamese letters are not round. 


It is significant to mention here that most of the early Oriya 
inscriptions were written in the Telugu script. So I want to argue that 
the Oriya script is round shaped not because of the iron stylus and 
palm-leaf, but because of Dravidian influence. This view gets strong 
support from the grammar of Oriya writing. For example, let us 
consider writing of consonant clusters. When such a cluster is written in 
the Devanagari script, the first letter is usually written half and the 
second one, full, e.g, %4e (/sk/), GF (/ND/), FU (/jy/), etc. Notice 
that in the first cluster-& is written partly (full =) whereas => is fully 
written. Similarly V and <~ are part letters (full = U7 and 337) but 
and 2 are full in the second and third clusters. But it is just the 
reverse in Oriya, Bengali, Assamese and Maithili (Tirhuta). In fact, in 
all these languages, the first letter is written full and for the next either 
an allograph is used or it is written below the first one in small size in 
case it does not have an allograph. For example, if we write the above 
three clusters in Oriya, they will look like gf, 9 > GY 
respectively. It needs to be stated here that consonant clusters in 
Telugu and Kannada are written exactly in the same way. Let us take 
Telugu examples: Dees bb, % we Thus, the eastern NIA languages 
mentioned above use the Telugu-Kannada system of writing, not that of 
Sanskrit or Hindi. 


5.8. Compulsion behind Purity of Tamil 


Finally, a word about the purity of Tamil. It has been claimed to 
be a ‘pure’ Dravidian language because it possesses very limited 
Sanskrit loanwords vis-a-vis Telugu, Kannada and Malayalam. Though. 
scholars (e.g. Burrow 1968: 300) have expressed their surprise at the 
small number of Sanskrit borrowings in the early Tamil texts, I am not 
aware if anyone of them has discussed its cause. I have argued 
elsewhere that the contact between Tamil and Sanskrit was 
predominantly literary and that the structure of Tamil was such that it 
could not have borrowed freely from Sanskrit. We know that "...the 
phonological structure of Tamil was (and still is) conspicuously 
different from that of Sanskrit. Tamil did not possess aspirated and 
voiced consonants (and obviously letters to represent them), consonant 
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clusters, and no word in it began with /r/ and /// when all these were 
quite common in Sanskrit." (Mohanty 2000:74). Therefore, writing 
Sanskrit words with less number of letters, with the restriction of not 
using /r/ and /l/ word-initially, and without consonant clusters, was 
certainly an unmanageable task for the literate Tamilians at that time. 
All these naturally lead us to believe that Tamil has less synchronic 
variation and more polysemy. Again, due to its polysemous nature, 
Tamil should be more difficult for others to learn than its sister 
languages. These observations, however, need to be substantiated with 
facts and figures. 


6. Conclusion 


To conclude, the following points have been made in this 
presentation: First, language is a very important tool to reconstruct the 
culture and history of the past. Second, Indianisation or nativisation of 
the Sanskrit language and culture in all likelihood started from Central 
India, and Dravidian influence played key role in it. Third, presence of 
Dravidian grammatical characteristics as well as lexical items covering 
verbs, adjectives and nouns in the Indo-Aryan languages starting from 
Oriya, Bengali, Hindi to Nepali shows the wide range of Dravidian 
substratum in the Indo-Aryan speaking area. Fourth, writing of 
consonant clusters in the Indo-Aryan languages like Oriya, Bengali, 
Assamese, Maithili (Tirhuta) follows the Dravidian system. Fifth, more 
frequent use of Dravidian words in the dialects than m Standard Oriya 
gives a clue to hypothesise that dialects carry more convergence 
features than their standard counterparts. Sixth, since Oriya possesses 
more Dravidian characteristics than Bengali and Hindi, it seems the 
Orissan region acted as a corridor through which convergence features 
migrated from Dravidian to various Indo-Aryan languages. 
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BASAVESWARA, TOLSTOY AND KUVEMPU 
A COMPARATIVE STUDY 


D. JAVARE GOWDA 


Part I 


I am highly grateful to Prof. V.I. Subramoniam, Honorary 
Professor of International School of Dravidian Linguistics, for inviting 
me to deliver the Endowment Lecture on the occasion of the Annual 
Conference of Dravidian Linguists. 


Lives of great men are lighthouses of mankind. They strive hard 
and incessantly for the good of others, disregarding their personal 
comforts and happiness. They knew very well that individual happiness 
is contained in the happiness of the entire community. They believed in 
the supremacy of the soul-force over brute-force. They knew very well 
that money is a curse and mundane luxury is a sin. Their life was 
dedicated for the achievement of moral and spiritual values. Sacrifice 
and suffering for the eradication of evil and the establishment of the 
Kingdom of God on earth is their sole intention. They are the 
embodiment of pity, compassion, mercy, humanitarianism, egalitarianism 
and love. They dedicate their lives for the search of God and Truth and 
for the service of humanity. A comparative look at the lives of these 
three great men, viz. Basaveswara, Tolstoy and Kuvempu (K.V. 
Puttappa) will illustrate the above points. 


These three extraordinary geniuses who are known for religious 
pursuits, humanism and egalitarianism belong to different times and 
climes. They not only represent the spirit and the values of their age but 
also sincerely and assiduously endeavour for the elevation and spread 
of universal values, which are the outcome of their rational thinking and 


* Endowment lecture delivered at the 35th A.l.C.D.L. held at the University of 
Mysore from 21-23 June 2007. 
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search for truth. Precept and practice are identical according to these. 
Universal peace and happiness are the main aims of their teaching. 


Basaveswara was born in the year 1131 at Bagewadi situated in 
Byapur district of Karnataka to orthodox Saiva Brahman parents. He 
was precocious child, endowed with extraordinary power of intellect, 
insight and prophetic vision. He had learnt Vedas and Upanishads at 
the age of eight and developed rational thinking. He was stricken with 
awe when he saw around him dreadful and horrid social evils and even 
gender discrimination with regard to religious rites. Refusing to 
undergo thread ceremony therefore, he relinquished his house, went to 
Kudala Sangama with his sister Akkanagamma, stayed with Jatavéda 
muni as his disciple. He never turned his sight again towards his 
hereditary habitat. In an atmosphere of peace and sacredness, he 
continued his studies, devoted himself to meditation, prayer and 
spiritual practice for nearly fourteen years and attained spiritual bliss. 


Persuaded by Jataveda muni, he married his own maternal uncle 
Baladeva’s daughter, Gangambika. Baladeva was the Chief Minister of 
Biyjala, a Kalachuri King at Kalya. Bijjala appointed Basaveswara as his 
Finance Officer and gave his foster daughter Nilambika (daughter of 
Siddarasa who was also a minister) to him in marriage. 


Basaveswara, the visionary and a great thinker, decided to utilize 
his position for the well-being of humanity and to translate his ideas, 
ideals, thoughts and dreams into action, to eradicate all the inhuman 
social evils which were detrimental to the progress of the country and to 
evolve a new religion whose sole aim was to develop an egalitarian 
society. The news of his spiritual greatness and his determination to 
create a new society free from any kind of discrimination, spread over 
four corners of the country wherefrom thousands of disciples gathered 
in Kalyana for his blessing and guidance. Once he came to Kalyana, 
Basaveswara never seems to have left the place; he attracted people 
from all over India. Basaveswara without fear or favour continued his 
mission with zeal and zest for nearly ten years till his death in 1167. His 
conception of total revolution to redeem a country beset with 
multitudinous problems is a rare phenomenon. 


The fourth and the youngest son of his parents Leo Tolstoy lived 
mostly in Yasnaya Polyana for eighty-two years from 1828 to 1910. He 
lost his parents during his childhood and came under the tutelage of 
two aunts. He inherited the estate of Yasnaya Polyana in Central 
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Russia from hgs mother’s family. Yasnaya Polyana was inseparable from 
him throughout his life. It was so sacred and inseparable from his being 
that it had become the part of his mental life and that he used to carry 
its impressions wherever he went. Even when he visited Moscow for a 
temporary stay, he used to hanker for his return every night. Though his 
material body was in Moscow, his subtle body used to be in Yasnaya 
Polyana always. Yasnaya Polyania was for him a spiritual haven, a way of 
life, a sacred refuge and without it Russia was nothing to him. 


Since he belonged to a titled family with a big estate, he was 
highly conscious of his rank as a young man and never slipped into the 
company of the intelligentsia of those days since they had no ties with 
the soil. In fact, he liked solitude during his younger days. 


His studies at Peterburg and Kazan universities for a few years 
were not satisfactory, but he was voracious reader and had the 
unquenchable thirst for knowledge. He had studied most of the 
well-known authors, both western and eastern philosophy, with avidity 
and_ sincerity. With patriotic fervour, he accompanied his 
soldier-brother Nikolay to Caucasus in 1851 and joined the army as a 
cadet. 


Even when he was in the army for about five to six years, his 
reading was tireless, intense and copious. His thirst for knowledge 
never waned and his capacity for original thinking increased day by day. 


During his military service, he led a very despicable and 
lascivious life. Gambling, drinking and sex were his routine crimes. At 
the same time, he started writing. In the beginning itself, he was 
confident of producing good literature. The dairies he has been 
maintaining since his student career is a mirror to his mind and heart, 
insight into human nature, his intellectual talents, emotional outbursts 
and spiritual revelations. The recordings of his daily experiences are 
very objective but his weaknesses and sins are sometimes exaggerated. 
One of the dairies records his aspiration to found a new religion, 
rational Christianity, removing mystery, dogmas and fanciful stories. 


He did not travel outside Russia extensively except twice for 
short periods, when he visited France, England and Germany. Even 
after his marriage with Sonya at the age of thirty-four, his search for 
truth was unabated. He is appalled at the poverty and hunger of 
peasants though they did work hard. In spite of his wife’s protests, he 
starts living like a peasant and attempts to do away with serfdom. He 
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establishes a school in his village for the education of the children of his 
serfs and he himself teaches them. He incurs the displeasure of the Tsar 
by proclaiming that the government and its organs are the agents of 
violence which should be avoided if the kingdom of God has to be 
established on earth. It is really astonishing that a sinner becomes a 
great renowned saint through prayer, sacrifice, renunciation, restless 
endeavour and service to humanity. Being forlorn due to 
non-achievement of his ideas, coupled with his wife’s indifference, he 
leaves his home with a friend without informing anybody and meets his 
tragic death about fifty miles away from his dear village Ydasnaya 
Polyana. . 


Kuvempu was born in 1904 and brought up in an undivided 
farmer’s family of Kuppalli, a small hamlet, with only one house 
perched in the hilly region of Shimoga district amidst nature’s plenty. 
After completing elementary education in his village and Tirthahally, 
he went to Mysore for further studies. More than what he learnt at 
school, he learnt a lot of things, especially the divine message from 
nature which was his spiritual guru and guide. Nature became an 
indivisible vital part of his life and soul till the end of his life. 


The Mysore city Public Library became his haunting temple of 
learning where he gobbled up all the major works of English and 
European literature. He felt that a kind of blissful metamorphosis was 
taking place in his life and soul when he read the teaching of Sri 
Ramakrishna Paramahamsa and Swami Vivekananda. These two sages, 
the benediction of nature and the great books he read, roused his 
dormant faculties and had a tremendous influence throughout his life. 
He composed several hundreds of poems in English. When he showed 
them to G.H. Cousins, an Irish poet, during his visit to Mysore, Cousins 
advised him to write in Kannada. Kuvempu followed his advice and it 
was a turning point in the literary career of Kuvempu. 


For more than twelve years, he lived in Sri Ramakrishna 
Ashrama along with the sanyasis in response to the invitation from 
Swami Siddeswarananda whom Kuvempu considers as his guide and 
philosopher. His association with the Ashrama gave impetus to his 
thirst for mastering Indian philosophy. He was well-versed in western 
philosophy as it was his special subject for the bachelor’s degree. 


This philosopher-poet was a prolific writer. His multi-faceted 
genius has not left any literary form untouched superbly. His 
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contribution to Kannada literature include lyrics, narrative poems, 
epigrams, odes, sonnets, prose-poems, short stories, novels, 
biographies, essays, lectures, one-act plays and serious dramas and a 
multidimensional epic. The grand epic is the result of his life-long 
tapasya. He has enriched Kannada literature with novel themes, variety 
of experiences, revolutionary ideas, spiritual visions, dynamic imagery, 
rare and unusual similes and metaphors and profound thoughts. He has 
invented his own style to befit his themes, thoughts and characters. 


His first love was literature and philosophy and his second love 
was teaching. Starting his career as a lecturer in Kannada in the 
University of Mysore in the year 1929 and gradually mounting the 
ladder of hierarchy, he became Vice-Chancellor of the University of 
Mysore. With multi-sided developments, the period of his regime is 
rightly hailed as the golden period of the University. Though his first 
and foremost concern is spiritual development, he is a keen observer of 
the society. He says, "whenever I am at home or elsewhere, I am a 
constant traveller on earth or heaven". With indomitable courage as he 
. fought against superstitions, priestcraft, corruption, moral bankruptcy 
and myriad other social and political evils. He is of the firm opinion that 
spiritual practice and rational outlook are not contradictory. Like 
Basaveswara and Tolstoy, he is also a consummate teacher and 
educationist and he does not preach what he does not practise. 


Part II 
A. Religion and Priestcraft 


This trio believed that religion was the panacea for all evils, 
unhappiness, distress and mutual distrust among individuals in the 
society. But it should be rational and scientific, devoid of superstitions, 
dogmas and fanciful stories. They waged a fierce and relentless battle 
against superstitions and dogmas till the end of their lives. 


God is the source, strength and the mainstay of religion or 
Dharma. The trio had uncompromising and abundant faith in God. 
According to them, God, the almighty, is only one, but with varied 
forms. They do not hesitate to deny the multiplicity of godhead. Though 
they did not deny the use of symbols like the cross, the /ingam and the 

light, they worshipped mainly the formless, changeless, omnipotent 
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God. They adored God through prayer, meditation and penance. This 
adoration was the indivisible part of their lives. 


For them, truth was God. They were dedicated to the search for 
truth and it was the goal of their lives. They nevér shrunk back from 
asserting the truth, even in the face of danger. Religion divorced from 
truth and morals is heresy. Religion founded on the basis of morality is 
a boon to world peace. Basaveswara says that compassion, mercy, 
kindness and love are the characteristics of good religion. Tolstoy 
asserts that where love is, God is. 


The trio were disgusted with what was happening in the 
orthodox religious churches and temples. The priestly-class used to 
interpret religious dogmas and the teaching of original masters, 
prophets and founders to suit their own selfish ends which were 
detrimental to peace, the well-being and the progress of society. In fact, 
they foisted concocted rituals, customs and superstitions on the people 
and became religious dictators and cruel exploiters of society. 
Basaveswara, the spiritual giant, the great seer, who was so far 
unsurpassed with regard to rational thinking and revolutionary ideas, 
rebelled against the existing orthodox religious books and codes which 
engendered all kinds of social evils. He swears in the’ name of Kudala 
Sangama in the following strain: 


I shall rub off the Vedas 
I shall bind the Sastras with fetters 
I shall tear the spine of Logic 
I shall -chop off the nose of Agamas 


He did not see any hope for the society unless the old scriptures 
were completely annihilated. He found and fostered a new religion 
(Lingayitism) which was supposed to combine humanistic, socialistic 
and egalitarian ideas. 


Tolstoy genuinely trusts God and the teachings of Christ. He 
firmly believes that the church distorts the teachings of Christ. He 
condemns the church for encouraging dogmas like redemption, for 
associating with political powers in perpetrating evils and many such 
things. His allegiance to Christ as a moral teacher was genuine. He 
upholds the Sermon on the Mount. He only regrets that the orthodox 
church was guilty of obscurantism and repression. He feels sincerely 
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that the priestly-class is responsible for this malaise. He declares that he 
adores Christianity not Churchianity. A curious thought occurred to 
him once that he should found a new religion based on rational, simple 
and scientific principles, avoiding all mystery, dogma and rituals. His 
resolute attitude to orthodox Christianity invites excommunication by 
the Holy Synod. 


Like Basaveswara and Tolstoy, Kuvempu strongly opposed the 
priestcraft. He is of the firm opinion that priesthood has been 
responsible for all-round regression and social diseases in this country. 
Righteous indignation expresses itself violently in his speech and 
writings. Throughout his life, he has battled incessantly against 
superstitions, dogmas, meaningless rituals, corruption and moral 
degeneracy. He is a spiritual genius with a scientific attitude and 
rational outlook. 


Both Basaveswara and Kuvempu discourage temple culture 
since they are places of corruption, exploitation, superstition and 
trickery. 


Two of his dramas, viz. Jalagara (‘Scavenger’) and Siidra 
Tapasvi, expose the social evils existing in society and the cruelties of 
Brahmans against Sidra. Jalagara is an allegorical play and the 
principal character is the symbolic representative of untouchables and 
he exposes the pretensions of upper-class people. The purpose of Siidra 
Tapasvi is to contravene the Vedic dictum that Siidras are not eligible 
to perform penance. He raised his voice against the tyranny of 
priestcraft even in his novels. 


In one of his epigrams, the poet makes an attack against both the 
priest and the King: 


Unless the intestine of the last priest 
Becomes a noose to the neck of the last King 


The heaven of freedom is only a mirage. 


B. Relationship with King 


Tolstoy would have embraced Kuvempu if only he had listened 
to his epigram. All the three persons stood for individual freedom. They 
knew very well that it would not be possible to enjoy full freedom under 
sovereignty. Basaveswara was a minister in Bijala’s cabinet. As long as 
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the King reposed unfailing, ungrudging, unique confidence in him, he 
served him and the people sincerely. Bijjala also did not interfere with 
his revolutionary type of religious and social activities. With thousands 
of followers from all over the country gathered around Basaveswara 
and people starting to adore him, the courtiers became jealous of him 
and carried malicious propaganda against him, and Bijjala began to 
suspect him. As was wont with him, Basaveswara who was a worshipper 
of love, resigned his job without any protest. 


Kuvempu was a servant in the King’s government. Being a poet 
and a spiritual aspirant, he kept aloof from the palace and its associates. 
He wrote that "a cuckoo which sings in the spring season does not crave 
for the title from the King". He does not have any obligation for th 
King. : 


Tolstoy was not in any way connected with the emperor Tsar, 
except that the hereditary title "Count" was attached to his name, but he 
had aversion to any kind of government (including church government) 
which concentrates power and maintains police, jails and army which 
embody violence. All these saint-like persons believed firmly that 
non-violence based on love alone was the only solution for all the evils 
and the problems the world was facing. | 


And love! Yes, the concept of love as visualized by Tolstoy is 
something unique and magnificent. According to him, non-resistance to 
evil is the basis, the source and the dynamic force of love, and when it 
pervades the whole world, the world will become the Kingdom of 
Heaven. That is the motive force of religion even according to the 
philosophy of Basaveswara and Kuvempu. 


The doctrine of non-resistance is based on the aphorism of the 
Apostle St. Matthew: "Resist Not Evil". Resistance will lead to 
vengeance which in turn will spread hatred. Non-resistance will breed 
love and love alone. All the three knew that the soul-force, not the 
brute-force was the divine law. Tolstoy was certain that violence was 
incompatible with love. 


C. Common Man, Peasants and Rich People 


All of them have a deep concern for the common man. They feel 
sad that when the poor people along with their children suffer from 
hunger, the rich people live in luxury, wasting things nonchalantly. In 
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fact, they dislike the rich people who are merciless towards the poor. 
Tolstoy accuses all the rich, avaricious people for their wrong way of 
life quoting Christ’s words that a camel can more easily go through a 
needle’s eye than a rich man can enter the Kingdom of God. 


He ponders that every acquisition that is disproportionate to the 
labour spent on it is dishonest like the profit made by banks and holds 
the view that the agricultural labour alone is productive. He declares 
that possession of property and money is a theft and a curse. People 
who eat without labour are sinners. He was himself fetching water to 
the house from outside. He taught himself the cobbler’s trade. He used 
to plough the field during the sowing season. He knew woodwork also. 
Bread labour, he said, was the first human law. He attacked all 
specialists who exploited others. He advocated dignity of labour and_ 
equality among all human beings. 


Basaveswara knew that the rich people were haughty and 
behaved like people possessed by the devil. He says that "one can 
induce a man bitten by a ‘snake to speak, that one can make the 
devil-possessed man speak, but it is not possible to induce a man 
possessed by the ghost of wealth to speak". 


Basaveswara’s ‘Work Philosophy (Kayaka Tattva) is deeper, 
more profound, more scientific, more comprehensive than Tolstoy’s. It 
is relevant for all countries and for all times. It upholds the: dignity of 
labour. It is endowed with economic, social and spiritual significance. 
Basaveswara pronounces that ‘work is worship’ and nobody ‘however 
’ big he is including guru’ is exempt from pursuing this rule and that 
when everybody adopts this principle, there will be no poverty on the 
earth. Kayaka along with Dasoha (the theory of distribution / sharing) 
are enough to solve all the economic problems confronting the world. 


Being born in a farmer’s. family with a rural background, 
Kuvempu is painfully acquainted with and afflicted with numerous 
troubles and tribulations, the growing poverty, a myriad of social evils 
and the cruel injustices from which the downtrodden are suffering. He © 
gives a vivid description of what he saw and felt, in his poems, stories 
and novels. Kalki is one such long poem sprinkled with abundant 
imagery, similes and metaphors, where he gives vent to the emotional 
outburst of righteous indignation. He is very much agitated at seeing 
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yawning differences and injudicious discriminations among the people 
in the city. 


The mansions of the rich dazzle on one side 
The dirty huts of the poor on the other side 
The sweet voice of songs on that side 
The incessant cry of lament on this side 
There they throw waste food for pride 
For want of food they groan and suffer here 
There they are dressed in glittering silk 


Not even a rag to cover their loin here 
The narration continues in the same style for a length of more 
than one-hundred-and-fifty lines. The poor man’s hunger turns into fire 
which emits flames all around and these flames engulf the entire 
community of rich people. The poet predicts that the poor man is Kalki, 
the incarnate who brings about the end of Kaliyuga. 


Only a very few literary giants like Tolstoy, Prem Chand and 
Kuvempu all over the world have any regard for the peasants. In 
nineteenth-century Russia, only nobles possessed all the land available 
in that country and the serfs laboured for them with hungry bellies and 
half-naked bodies. Even Tolstoy was a noble possessing thousands of 
acres of land. Towards the end of his middle age, he began to realize 
the inhuman miseries and malefic horrors the peasants were subject to 
and began to atone for the guilt as he was a part of the system. He was 
very sad to see the collaboration of landlord government and the 
landlords in treating them cruelly. He tried sincerely to ameliorate their 
miserable conditions, to free them from feudal oppression, and to 
abolish serfdom. 


In the entire Kannada literary field, there is not even a single 
writer except Kuvempu who had acquainted himself with the problems 
of the peasant. Being born in a farmer’s family, he had seen the plight of 
the farmers at close quarters and hence he had better regard, sympathy 
and compassion for them. He calls the ploughman as a yogi. His novels 
and stories contain many episodes relating to the life of farmers and 
peasants. He points out that feudalism and serfdom existed side by side 
in Karnataka also. He has composed a good number of poems 
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depicting the various aspects of a farmer’s life. A few lines from To the 
Hero of Ploughshare may be extracted here to show how much regard 
he nourishes towards them: 


Your are the Gandhi of the land 
Delight of the earth 
And the Nation’s power 


Are you inferior to Nehru 
even in patriotism? 
They are mere names; breath you are 
To the Goddess of the land. 


Kuvempu places the farmer and the peasant above all people 
living around him, irrespective of their ranks and professions. 


The trio loved this ‘earth with devotion, respect and gratitude, 
since they are born here, live here and earn salvation here. Nature is the 
part of the earth; and man is the part of nature. Nature gives him all 
comforts, including energy and bliss. The earth is not only a 
Karmabhumi (where man has to discharge his duties) and a 
Bhogabhumi (‘place of worldly pleasures’), but Yogabhumi (‘a sacred 
place for the attainment of supreme bliss’) also. In view of this, they 
thought that it was not desirable to abandon worldly life instead, they 
should lead an honest and detached life, rendering service to humanity 
and attending to religious duties. 


D. Money is a curse 


According to these saints, money is an evil and a curse. They say 
that there is a connection between gluttonous rich people and the 
hungry, wretched, miserable people. If only the wealthy people 
possessed sympathy and compassion, the conditions of the poor people 
would have been entirely different. There would not have been any 
difficulty for the people to get at least the minimum requirements. 
None of these three sages disdain the people earning money because all 
the people born on the earth will not have the same talent required for 
the acquisition of wealth. Their only objection is concerned with 
accumulation. They desire that the money got by God’s grace should be 
distributed to the unfortunate children of God. 
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E. War and its consequences 


The lure of patriotism incites Tolstoy to rush to the war-field in 
his youth. While he was serving in the war, he was saddened formidably 
and disgusted at seeing the flow of blood around him: the pile of 
corpses and tumultuous cries and the woes of forlorn relatives. After he 
returned from the war, all the horrors, the ruins and losses of war 
crowded his mind. He began to ruminate over the evil consequences of 
war and came to the conclusion that war, instead of solving the 
problems, begets more and more problems, more cruel than the earlier 
ones. Violence once let loose can never be halted except through love 
and renunciation. Ultimately, he systematises his famous doctrine, 
known as ‘non-resistance’ (‘Resist No Evil’). 


In his ‘play, Smasan Kuruksétra, Kuvempu describes 
heart-rending horrid consequences of the war. That insane, cruel 
destruction in terms of human beings, material things and cultural 
values is irretrievable is the message of the play. He says that war 
breeds hatred, causes vengeance, spreads disquiet, destroys all good 
and bad things alike and stops all kinds of progress. He admonishes 
that love which sustains life and the world is the first casualty in any war 
and with the loss of love, a demoniacal situation prevails in the world. 


Basaveswara was involved in a non-violent indirect religious war. 
Tolstoy took active part in the war. Kuvempu lived during the freedom 
movement. Though he did not participate directly in the movement, the 
pen was the real sword for him and it never missed even a single event 
during the struggle. He inspired the fighters involved in the cause of 
freedom through his poems. Whenever he recited his poems in a 
crowded assembly, thunderous shouts, claps and censures would rip the 
sky. Sometimes there used to be rumours that the government might 
warn the poet for inciting people by reading ‘seditious’ poems. 


F. Champions of Humanism 


All the three venerable wise people were great champions of 
humanism, socialism and egalitarianism. They respected religion and 
God, based on rational and scientific thinking and unsullied by dogmas 
and superstitions. The worship of God and the pursuit of religion, in 
their opinion, should not come in the way of scientific change and 
progress. They know very clearly that for all the evils existing in the 
world, the fraudulent orthodox priesthood is the root cause and a 
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menace to the peace in the society. They are, in a way, anarchists and 
revolutionaries who dedicated their entire lives to find out the alternate 
religions on a rational basis. Basaveswara founded a rational religion 
whose aim was to create a casteless society based on equality and 
dignity of labour. Though Tolstoy and Kuvempu did not establish any 
new religion, they expounded certain doctrines and principles based on 
life-long experience and search after truth and values. In a way, they 
were great humanitarian reformers whose main interest lay in uprooting 
injustices and discrimination from the society. 


G. Teachers of Humanity 


All literary people are supposed to be great teachers of 
humanity. These great literary men are extraordinary reformers who 
have been superb educators also. Great literary works like Ramayana 
and Mahabharata are eternal teachers of humanity. Therefore, all 
literary men, small or big, in one way or the other, are considered to be 
educators and reformers. However, Basaveswara, Tolstoy and 
Kuvempu are teachers par excellence. Though Basaveswara was a 
minister, he must have been, as often as possible, preaching to the 
disciples that thronged around him from all over India, the doctrines of 
Lingayat religion. When one is absorbed in his Vachanas 
(‘prose-poems’), one feels the presence of Basaveswara in front of him. 


Tolstoy became a teacher by accident whereas Kuvempu was a 
teacher by profession. Hundreds of serfs were slaving in Tolstoy's estate 
like dumb animals. He was very much agitated at the miserable plight of 
the serfs. He tried to abolish serfdom. In the meanwhile, he started a 
school for them. He himself began teaching them and wrote primers 
and tried to introduce reforms in the then-existing educational systems. 


Like many literary people, Kuvempu pursued two professions: 
literary and teaching - one for serving art and for spiritual enjoyment, 
and the other for material benefit. In his case, both were 
complementary to each other. He is a serious, profound thinker-en any 
subject of his choice. Being connected with the university education in 
different capacities from lectureship to the Vice-Chancellorship for 
more than three decades, he has rich diverse and vast experience about 
the system of education and its problems. He has written a lot about 
academic matters, several educational policies and problems and 
psychological and social topics. He gives a clarion call to the youth to 
adhere to rational thinking, to develop scientific attitude, to shun 
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superstitions, to cultivate good habits, to fight against priesthood, and 
to work hard. He has written a number of poems and essays on 
language problems. Like Basaveswara and Tolstoy, he is also a 
champion of individual freedom. | 


These three great thinkers are philosophers too whose doctrines 
contain truths which transcend geographical and time limits. Their 
actions are models to others and their utterances concerning any 
subject are prophetic. 


H. Politics 


Like Basaveswara, Tolstoy forbids taking vows which are likely 
to obstruct the free play of one’s conscience and rational thinking. 
Though Basaveswara was in the thick of politics, he does not involve 
himself too much. His social and religious activities, his occupation with 
his disciples, and his search for truth and his philosophical resignation 
towards worldly matters - all these things give relief to him from 
politics. Tolstoy and Kuvempu never indulged in politics and are farther 
away from it but they watch political events and situations keenly in a 
detached manner and express their opinions impartially and boldly for 
the well-being of the society. Like Basaveswara, they are also 
anarchists. 


I. Love of Nature 


Both Tolstoy and Kuvempu love nature, but differently. During 
his visit to Switzerland, Tolstoy says that: 


"I love nature when though it surrounds me on all sides 
and extends unendingly, I am part of it. I love it when on all sides 
I am surrounded by hot air and that same air rolls away to 
unending distance and those same sappy leaves of grass which I 
crush as I sit on them form the green of the boundless meadows, 
when those same leaves, which fluttermg in the wind run their 
shadows across my face, form also the dark blue of the distant 
forests, while the same air one breathes makes the deep, light 
blue of the immeasurable sky, when you do not exult and rejoice 

alone in nature but around you myriads of insects buzz and 
whirl, and beetles clinging together creep about and all around 
you birds overflow with song". 


1. The Life of Tolstoy, Almer Maud, Volume |, Page 188. 
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He was mostly enchanted by the external beauty of nature. He 
advised people to be in close affinity with nature, the beauty of which 
will have a durable, curing effect on their mental and physical 
behaviours. 


Kuvempu’s love of nature is entirely different from that of 
Tolstoy. He was born, cradled, caressed, nourished and brought up on 
the lap of nature. He heard the voice of nature even when he was in his 
mother’s womb. Starting from lullabies, nature taught him the secrets of 
creation and the message of the Almighty. In the variegated myriad 
beauteous forms of nature, he saw the oneness and the manifestation of 
pervasiveness of the universal soul. The flowers, the lush greenery, the 
morning and evening sun, the moonlit night, the starry blue sky, the 
birds, the butterflies, the clouds, the hills and many other phenomena 
thrill him and kindle in him the latent divine passion. The’ nature 
therefore avows as his first teacher. When he came into contact with Sn 
Ramakrishna Mission during his college career, he masters Vedantic 
message. When he approached nature, with this background, 
metaphysical urge and mysterious experience possess him. 


Thus his passionate love of nature becomes a dominant motif in 
his poetry. At dusk time, Sahyadri seems to him to be in yogic trance. 
The winging of cranes in swing technique looks like "God’s Signature" 
to him; sunrise or moonrise are both God’s grace. Seeing the vast 
expanse of green, the poet’s soul turns green. He sees in the clouds the 
purest souls. The sky becomes the art gallery during the cloudy season. 


All the three seers are something like magnets attracting people 
from afar and they do not generally stir out of their places to which they 
are very much attached. Kalyana and Basaveswara are inseparable. 
After Tolstoy settled down in Yasnaya Polyana, it became a part of his 
life. It was a way of life, a support to him and at times a refuge, a focus 
of ancient pieties and his window upon Russia.” 


Whenever he goes to Moscow, he returns immediately. "Without 
Yasnaya Polyana, I find it difficult to imagine Russia and my attitude 
towards her." He cherished the rural simplicity which had governed the 
lives of his forefathers.>. 


2. Tolstoy, Henry Gifford, Page 3. 
3. Ibid., Page 4. 
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Kuvempu also is very much attached to his native place, 
Kuppalli, which perches on the Sahyaddri mountain. Though his 
profe.sion ties him to Mysore physically, his heart is pierced by 
hom. ickness, his spirit remains in his village, often wandering in the 
for:- . His anguish is expressed in a poem captioned J Shall Go which 
con» ms four-lined twenty-six stanzas. Only one stanza has been quoted 
o- — y, for the sake of simplicity. 


I shall go, shall go to my beloved nest 
The land of hills, home of rains, the abode of rich beauty 
Bored with the plains, I shall go to the forest in the hills and valleys 


‘+o the nest of green loveliness, to the sun’s warmth and to the cool shade.’ 


In one of his couplets, Kuvempu expresses thus: "Remaining at 
home and wherever I am, I travel all over the world and heaven. Having 
been married, I remain a great bachelor". 


J. Truth is God 


During his youth, Tolstoy was an addict to smoking, alcohol, 
gambling and sex. "Tolstoy the sensualist became Tolstoy the ascetic" 
after he married and settled down in Yasna@ya Polyana. In this respect, 
Basaveswara and Kuvempu are in total contrast with Tolstoy and even 
from their childhood, they lead pious lives. Tolstoy, who equates truth 
with God, has recorded ail his sins in his diary and has expressed 
himself candidly. He not only castigated his sins but also tried to 
improve the character for the benefit of himself and mankind. 
Kuvempu also maintained diary for about seventeen to eighteen years, 
recording therein whatever happened in his life, good or bad. Here and 
there, he mentions that he has committed a ‘heinous sin’ and entreats 
humbly and sincerely to Sri Ramakrishna Paramahamsa that he may be 
pardoned. To err is human and atonement is divine. Even Basaveswara 
must have committed mistakes. He does not spare himself and 
confesses frankly before Kudala Sangamadeéva that he has committed 
crores of mistakes and supplicated that he may be pardoned. All the 
ii. ze seers believe that Truth is God and practise their beliefs seriously 
< sincerely. | 


4° anslated by KRR, Selected Poems of Kuvempu, Prof. D. Javare Gowda & Dr. 
‘adhan Gurudatta (Editors). 
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K. Hunting 


Being born in the midst of a dense forest, and in the family 
where every member was hunter, the habit of hunting was in Kuvempu’s 
blood. He was a good marksman also. Since he was devoted to studies 
and settled down in Mysore, he used to accompany the huntsmen 
occasionally. For Tolstoy, hunting was a luxurious game and indulged in 
it quite often. 


L. Conclusion 


Basaveswara was not a professional writer. Still, it is really 
astonishing that in the midst of multifarious activities, he was able to 
write Vachanas which are of eternal value and which have no parallel in 
any language of the world. Tolstoy is a prolific, profound writer whose 
works embody the whole experience of the age and inspire and 
influence the writers of the coming ages. He is a novelist par excellence. 
He has written short stories, dramas and essays. Kuvempu’s works are 
also known for both profusion and profundity. He is an extraordinary 
gemus who had cultivated all the literary forms with exceptional insight 
and unusual vision. Another renowned contemporary poet, Bendre, has 
admired him as the "epoch-making world poet’. 


This illustrious trio is called seers, sages, philosophers, 
reformers, revolutionaries and so on and so forth. They earned their 
sainthood through tireless efforts, complete dedication to God and 
service of mankind, philosophical thinking and uninterrupted penance. 
They are the moral conscience of their respective ages. ‘They are the 
greatest of the noblest of the human beings’ and they are the “universal 
men" as conceived by Kuvempu himself. 
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Abstract 


We search for potentially grammatical patterns in the Indus 
writing” based on the concordance of Mahadevan (1977). We make no 
assumptions about its structure or meaning. We only attempt to check if 
the Indus writing is meaningfully structured with specific rules to code 
useful information. In order to avoid possible errors in interpretation due 
to incompletely read or multiple copies of a piece of writing, and other 
possible sources of error, we create an Extended Basic Unique Data Set 
(EBUDS) based on the original electronic concordance of Mahadevan 
(1977). EBUDS consists of completely read single line texts on any side of 
the writing material. We exclude multi-lined text or partially read text. This 
gives us a Set of 1548 lines of data consisting of 7000 signs. We show that 
the ordering of the signs in the wnting is much more Significant than 


1. Address for correspondence: y_nisha@tifr.res.in. 


2. We use the word ‘writing’ in a generalised sense of the word in that it consists of 
information confined to a more durable medium than memory. While we do not 
imply linguistic writing, we do feel that any culture spread over a million square 
kilometres with such uniformity and a high degree of standardisation could not 
have indulged in random scribbling when they marked their writing material. We 
therefore feel that understanding their rules of writing is vital to understanding the 
culture. In this paper we do not comment on whether this is writing with sound or 
vowel writing or proto-writing of simple information. We leave that discussion to a 
later period. Similarly, when we use the words like phrases etc., we imply that a 
phrase is an information-containing unit of one or more signs which do not ap- 
pear without some form of prefixes or suffixes. However, we would like to draw 
attention to the compilation of Mahadevan (2002) on comparison of Indus writing 
with Aryan and Dravidian languages. 
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random association. The unit length of information is 2, 3 or 4 signs at a 
time. We then study the most frequently occurring two, three and four sign 
combinations in EBUDS. We find that in many cases, most common 
two-sign combinations are also parts of most common. three-sign 
combinations, which in turn also appear in four-sign combinations. 
However, in texts with just 2, 3 or 4 signs we do not often find these 
frequent two, three or four sign combinations. We therefore conclude that 
while the information is given in units of two, three or four signs, these are 
more like phrases where an additional sign is required to complete the 
grammatical structure. 


1. Introduction 


Indus writing has been subjected to intense speculation and 
work by a variety of highly capable researchers (for recent reviews see 
Mahadevan, 2002, Subbarayappa, 1997, Parpola, 2005). However, there 
is no clear consensus on either the nature of writing or its possible 
content (for two extreme views see Farmer, Sproat & Witzel, 2004 and 
Fairservis, 1983), though some internally consistent studies do hold 
promise (e.g. Mahadevan, 2007). For a summary of various attempts see 
Possehl (1996). We therefore attempt to understand the writing without 
any presupposition to its content, language or nature. In the present 
study, we attempt to understand the rules of writing. 


| 2. Data set 


We use the concordance of Mahadevan (1977), henceforth 
referred to as M77, as the basic data set on which we use analytical, 
mathematical and computational tools to attempt to understand the 
rules of writing of the Indus script. We use the original electronic data 
set of the same. 


Excluded data 


M77 records 417 unique signs” in 3573 lines of 2906 texts. For 
the present study, we remove those texts that can have potentially 
ambiguous reading. We create an Extended Basic Unique Data Set 
(EBUDS). It removes all texts containing lost, damaged or illegible 
passages marked by the sign and doubtfully read signs marked by 


3. The serial number of the signs used in this paper is similar to that given by 
Mahadevan in his concordance (1977). 
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asterisk. All texts from multi-lined sides are also removed. This is to 
avoid possible overflow in writing of information from one line of text to 
next. However, we assume that in multisided writing, the text on each 
side is independent of text on other side(s)’. It retains texts from those 
sides of multisided objects which have only one line of text. Texts 
appearing more than once are taken only once. 


The unit of textual analysis for the study of distributional 
statistics is a line of text. There are two reasons why it is not possible to 
consider the whole text on a single side as a unit for this purpose. 
Firstly, there is no way of knowing beforehand whether different lines of 
an inscription appearing on the same object or even on the same side 
have continuity of sequence or to be regarded as separate texts. 
Secondly, it is not possible to ascertain beforehand the real order (if 
any) of the lines of text appearing on the same side (Mahadevan, 1977, 


p. 10). 
Statistics 


Based on this we read the texts and we get the following 
- statistics. 


Number of texts occurring only once 1303 
Number of texts occurring more than once 245 


Therefore, Total number of independent texts 1548 


In EBUDS, 40 signs out of 417 present in the Sign List of 
Mahadevan do not make their appearance. Out of these removed 40 
signs, one sign (sign number 374) appears 9 times, one sign (sign 
number: 237) appears 8 times, two signs (sign numbers: 282, 390) 
appear 3 times, three signs (sign numbers: 324, 376, 378) appear twice 
and thirty-three signs appear only once in M77. 


Rules of reading the strings of script 


We make no assumption about the content or meaning of Indus 
writing. M77 shows that the script is written right to left though this has 


4. An earlier Basic Unique Data Set (BUDS) did not consider multisided objects. 
However, this significantly affected the data set available for analysis. The 
analysis reported here was also done on BUDS which gave identical results with 
reduced statistical significance. These results are given in Yadav and Vahia 
(2007). 
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no effect on our analysis. However, as a convention followed in the 
present paper, the texts depicted by pictures are to be read from right 
to left, whereas the texts represented by just strings of sign numbers are 
to be read from left to right. As our first emphasis is to attempt to write 
in the script rather than read we search for rules of writing without 
assigning meanings or interpretations. In the present work we only 
attempt to derive the grammar of writing in terms of search for patterns 
of sequencing of writing. We make no attempt to assign meanings to 
any of the signs. 


We do not take into account the variation due to archaeological 
context of sites, stratigraphy and the type of objects on which the texts 
are inscribed. 


3.1 Analysis 1: Preliminary analysis> 


We studied the frequency distribution of the signs and sign pairs 
to check the consistency of our database with that of Mahadevan 
(M77). Frequency distribution of the signs is given in Table 1. 


The comparative statistics of sign occurrences is also given in 
Table 1. In the first 4 columns we reproduce the data from table 11.10 
of M77. In EBUDS, the total sign occurrences are small and so they 
cannot be compared directly with M77. Hence, we study the frequency 
of n -signs where n is defined by column 2 of table 1. It should be noted 
that the signs that make up the frequent set are exactly the same in M77 
and EBUDS for the signs that form the first 4 rows. In columns 5 to 7 
we show that the most frequent sign in our data set accounts for 10.21% 
of our data set compared to 10.43% in M77. Similarly, the second most 
frequent sign accounts for 5.39% of our data set compared to 4.85% in 
M77 and so on. Hence, comparing data from column 4 and 7, we see 
that that the frequency distribution of common signs in both the data 
sets are comparable and hence our reduced data set is a true 
representation of the data of M77, with a reduction in total sign 
occurrences, but not in the percentage of total sign occurrences. In 
table 2 we perform a similar test for sign pairs. The data of M77 is taken 
from table 11.12 of the concordance for sign pairs. 


5. The analysis is done using computer programs written in PERL (Practical Extrac- 
tion and. Report Language) and shell script to extract required information from 
the digitized Indus data. The results were analyzed using Excel software. 
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Table 1: Frequency distribution of Signs 


Frequency In M77 Sa “= EBUDS 


range in 
M77 


> 1000 
999-500 
499-100 
99-50 


49-10 
9-2 


[NOS = Number of signs; TSO = Total sign occurrences; 
TSO% = Total sign occurrences (in percent)] 


Table 2: Frequency distribution of Sign pairs 
Frequency 
range in 
M77 


[NOC = Number of pair-wise combinations; TF = Total frequency; 
TF% = Total frequency (in percent)] 
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Conclusion from Analysis 1 


The frequency distribution of the signs and sign pairs is 
consistent with M77. The result of our analysis shows that the manner 
of choosing the data set has not changed the pattern of occurrence of 
various signs in the data set and the results are consistent with the 
analysis of M77. We therefore proceed to the next step of analysis. 


3.2 Analysis 2: Check against random order 


In order to ascertain whether the pairing of signs in the Indus 
inscriptions or texts is significant or not, we adopt a statistical 
procedure. The 377 signs studied here appear in 1548 texts, giving a 
total sign set of 7000 signs. We take a random number generator and 
reshuffle them such that the total number of occurrences of each sign is 
identical to the genuine Indus data set (EBUDS). We then break this 
string of 7000 signs in such a way that they mimic the genuine Indus 
data set in terms of number of text lines with a specific number of signs. 
Hence, this reshuffling provides us with a randomised data set for Indus 
writing. In order to ensure that the relative frequency of occurrence of 
different signs is the same in randomised data set, we simply randomise 
the sequences rather than create an artificial set of texts. This method 
therefore gives an accurate comparative randomised data set to check 
the units of information in the genuine Indus data set without changing 
the occurrence rate of different signs. We make 10 such random data 
sets and compare the frequency of most frequent combinations® of 
two-signs, three-signs, four-signs etc. in the randomised data set with 
the genuine Indus data set. If the Indus writing was simple pattern 
writing without information content, the sequence of occurrence of 
these signs would have a distribution comparable to random 
occurrence of different signs in the neighbourhood of another sign and 
the frequency of sign combinations of two-signs, three-signs etc. would 
be decided only by the statistics of absolute frequency of occurrence of 
a sign. The frequency with which a two or three sign combination is 
found in EBUDS data compared to random data indicates the level of 
significance of the writing. The observations are given in table 3 and 
figure 1. 


6. We do not look at the information content of singlets as complete unit of informa- 
tion. This is because by its very nature EBUDS has unique data set. Hence if a 
singlet comes more than once, its subsequent appearances will not be seen by 
the data set. Also, the nature of randomisation applied here does not permit us to 
test for the possibility whether singlets contain complete information. 
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Table 3: Result of Analysis 2 


FSC 


Random Data set 


58°56" 458.35 
a. ee a 
1.3 
1 1 1 1.0 
1.0 


[NOS = Number of signs in the sign combination; FSC = Frequency 
of most frequent sign combination; M = Mean; EB = EBUDS] 


NOS 


66 


Table 3 gives the result of Analysis 2. Here, the first column gives 
the number of signs in the sign combination. It is 2 for combination of 
two-signs, 3 for combination of three-signs etc. Columns 2 to 11 give the 
frequency of most frequently occurring sign combination of 2, 3, 4, 5 
and 6 signs for the 10 random data sets. Thus, for the first random data 
set, the frequency of most frequently occurring two-sign combination is 


Most Frequent Sign combination Frequency vs No. of signs in 
the combination 


180 ——_— $$$ 


ALT St t-te 


160 
140 
120 
100 


Frequency of most frequent 
combination 


No. of signs in the combination 


EE! RTE AP RT He ee 
ee Random Datasets (Mean) —e— Genuine Indus Dataset i 


Fig. 1: A plot of the frequency of most frequently occurring sign 
combination of lengths 2, 3, 4, 5 and 6 versus the number of signs in the 
combination for genuine Indus database (bold line) and ten random 
databases (dashed line). By Genuine Indus data set we mean EBUDS. 
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60, the frequency of most frequently occurring three-sign combination 
is 5, the frequency of most frequently occurring four-sign combination 
is 1, the frequency of most frequently occurring five-sign combination 1s 
1 and the frequency of most frequently occurring six-sign combination 
is 1. Similarly, for second random data set, the frequency of most 
frequently occurring two-sign combination is 54, the frequency of most 
frequently occurring three-sign combination is 3, the frequency of most 
frequently occurring four-sign combination is 1, the frequency of most 
frequently occurring five-sign combination is 1 and the frequency of 
most frequently occurring six-sign combination is 1 and so on for other 
random data sets. Column 12 gives the mean of the frequencies of the 
most frequently occurring sign combinations of lengths 2, 3, 4, 5 and 6 
of all the ten random data sets and the last column gives the frequencies 
of most frequently occurring sign combination of lengths 2, 3, 4, 5 and 6 
for genuine Indus data set for comparison. 


Conclusion from Analysis 2 


The analysis shows that there is a significant difference in the 
frequency of the most frequent sign combinations of different sizes 
obtained from the random data sets in comparison to that of EBUDS. 
The frequency of occurrence of the sign combinations of length two, 
three and four, in case of the ten random data sets, is considerably low 
in comparison to that obtained in case of EBUDS. Hence, it is 
justifiable to state that Indus texts followed certain rules and thereby 
meant something significant and meaningful. 


3.3 Analysis 3: Positional analysis of most frequent two-sign, 
three-sign, four-sign combinations 


In view of the importance of two-sign, three-sign and four-sign 
combinations from Analysis 2, we do a positional analysis of most 
frequent two-sign, three-sign, four-sign combinations present in 
EBUDS. We check whether these most frequent two-sign, three-sign, 
four-sign combinations tend to appear at fixed positions (i.e. left-end, 
middle or right-end) of the texts or they tend to appear as independent 
(or solo) texts. The positional analysis results for two-sign, three-sign 
and four-sign combinations are tabulated in tables 4, 5S and 6 
respectively. 


Table 4 gives the positional distribution of most frequent 
two-sign combinations (frequency of 25 or more). Several points 
immediately become clear. 
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Table 4: Positional Analysis of Two-sign Combinations 


| Two-sign Combination | Frequency Solo (%) | Left(%) | Middie (%) | Right (%) 
a ee 168 0.60 1.79 11.90 | 95.71 
76| 000 om0 | oan] to8r 
g9| ooo | 6st] 39 | 000 | 
3k” a ee" 68 | 1.72 7241| 25.86 | 0.00 
~~~ @ 66 0.00 | 000; 893] 91.07 
3a! 9 0.00 
R14] 9 20 
yas asa |) 00] 000 | 00 | 0000 | 
Qo} ii a a ) e ) 

Sea a ae] ea | a 
i «| 7 36 | 0.00 o0o| 80.66 | 19.44 | 

Y 12] G29] 3s} oo 0.00 
TOR aS A a ES ee ee 5 ia 
B2as|  as| ss] ooo | oss | tz | 18.18 | 

| & se] am ooo soz | | 
7 a a YS a) ed ee 
0 a, a ee ee | rast | 2222 | 
ae Tee ee ee 


1) All the two-sign combinations have a preferred location of . 
occurrence (i.e. left-end, middle or right-end). 


2) Their percentage of occurrence as independent texts (i.e. solo) is 


very small. 


3) The most frequent two-sign combination (267, 99) whose total 
frequency of occurrence is 168 appears most frequently at the 
right-end of the texts, its percentage of occurrence at the right-end 
of texts being 85.71. Similarly, the two-sign combination (336, 89) 
whose total frequency of occurrence is 75, appears most frequently 
in the middle of the texts, its percentage of occurrence in the middle 


of texts being 89.33. 


Table 5 gives the positional distribution of most frequent 
three-sign combinations (frequency 10 or more). The following points 


arise from this table: 


1) The three-sign combinations tend to occur in specific positions im 
the text (i.e. left-end, middle or right-end). 
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Table 5: Positional Analysis of Three-sign Combinations 


Three-sign Combination Frequency | Solo (%) | Left(%) | Middle (%) Right (%) | 


- 1 88.24 | 2.94 | 
U : 00 | 0.00 | 
| 1 saz! *% te2| G& 249 24 | AAT | 83.33 | 8.33 | AAT 
| Ub sai Y 109 | & 249 20|  — §.00 70.00 20.00 | 5.00 
aS ina * | oe 
| UF 342 ,.- + 171 19 6.26 73.68 5.26 | 15.79 | 
x 149 19 0.00 0.00 | 78.96 21.05 
ise 0.00| 100.00 0.00 | 
| © aos | | 81.25 | 6.28 6.25 | 
f 130) 16; 0.00 75.00 | 25.00 | 0.00 
; 4 | | | 


67 99; % 267) 14 0.00 | 0.00 : 7.14 | 92.86 | 


2) Their percentage of occurrence as independent texts (i.e. solo) 1s 
very small. 


3) The most frequent three-sign combination (336, 89, 211) whose total 
frequency of occurrence is 34 appears most frequently at the 
left-end of the texts, its percentage of occurrence at the left-end of 
texts being 88.24. Similarly, the three-sign combination (293, 123, 
343) whose total frequency of occurrence is 25 always appears at the 
right-end of the texts. 


4) Some of the important three-sign combinations arise as additive 
forms of frequent two-sign combinations listed in table 4. For 
example, the three-sign combination (336, 89, 211) arises from the 
two-sign combinations (336, 89) and (89, 211). However, this 
three-sign combination appears 34 times, as does the two-sign 
combination (89, 211), while the second two-sign combination (336, 
89) appears a total of 75 times, out of which it appears in this 
three-sign combination only 34 times. 


5) The second most frequent three-sign combination (293, 123, 343) 
consists of two-sign combinations (293, 123) and (123, 343). In this 
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case also the frequency of occurrence of the three-sign combination 
(293, 123, 343) and the two-sign combination (123, 343) is identical 
with 25 occurrences. 


Table 6: Positional Analysis of Four-sign Combinations 


Four: «sign Combination Frequency | Solo Solo (%h) | Left (%) | | Left (%) | Middle (%) - (%) Middle (%) Right (%) | (%) | 


6.25 | 68.76 625] +6878] an besdcn __ 8 75 | 
0.00 | 0.00 33.33 | 66.67 | 
0.00 0.00 | a333| 1667 | 
0.00 100.00 | “000| +000 | 
20.00 80.00 0.00 0.00 | 
20.00 2000, 2000) 40.00 
0.00 80.00 . 000;' 2000) 


Table 6 gives the positional distribution of most frequent 
four-sign combinations. The following points are evident. 


1) The four-sign combinations tend to occur in specific positions in the 
text (i.e. left-end, middle or right-end). 


2) Their percentage of occurrence as independent texts (i.e. solo) is 
very small. 


3) The most frequent four-sign combination (51, 130, 149, 342) consists 
of very frequent three-sign combinations (51, 130, 149) and (130, 
149, 342). This four-sign combination has the same frequency of 
occurrence as the three-sign combination (130, 149, 342). 


Discussion on Tables 4, S and 6 


We discuss the combined importance of tables 4, 5 and 6. In 
table 4, the two-sign combinations (336, 89), (87, 59), (171, 59), (249, 
162), (65, 67), (51, 130), (99, 67) and (123, 343) never appear solo and 
even within large texts they appear in the middle. This is in contrast to 
the two-sign combinations (267, 99), (391, 99) and (293, 123) which 
mostly appear at the right extreme of the texts. A third group of 
two-sign combinations (342, 176), (8, 342), (347, 342), (342, 1), (89, 
211), (59, 211) and (162, 342) mostly appear at the left extreme of texts. 
The two-sign combinations (48, yim and (245, 245) seem to be evenly 
distributed. 


The two-sign combinations that appear at extremities clearly 
form whole or part of text-starting or text-ending sign combinations. 
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We now look at the sign combinations that appear in the middle. The 
two-sign combination (336, 89) when combined with the sign 211 on the 
right comes solo 2.94% of the times or at left end 88.24% of the times 
(Table 5) indicating that if not complete text, it is a very decisive text 
ender sign combination. 


The two-sign combination (87, 59) when combined with the sign 
99 at the left appears always in the middle (Table 5). However, when 
the three-sign combination (99, 87, 59) is combined with 267 at the left, 
it mostly appears at the right end of texts indicating that this four-sign 
combination is a text beginner sign combination. It is interesting that 
another four-sign combination not listed in table 6 occurs 4 times and 
differs from this only by the rightmost sign where 267 is replaced by 391. 
This suggests that 267 has a meaning similar to 391. 


The two-sign combination (171, 59) does not appear with high ' 
frequency in’ three-sign combination table (Table 5). Even when 
appearing in a four sign combination, this pair (171, 59) appears on the 
left of another significant two sign combination (336, 89) as (171, 59, 
336, 89). 


The two-sign combination (249, 162) when combined with 342 at 
the right comes solo 4.17% of the times and at the left extreme of the 
texts 83.33% of the times. It is interesting that the sign 169 shows similar 
behaviour to the sign 162. 


Conclusions from Analysis 3 


1) The most frequent two-sign, three-sign and four-sign combinations 
appear at fixed positions. 


2) The exact location varies from combination to combination. 


3) However, frequently occurring two-sign, three-sign and four-sign 
combinations may be incomplete except of course when they occur 
as solo texts. It can be seen that two-sign, three-sign and four-sign 
combinations which are complete have typically one of the 
text-enders (mostly 342 or 211) at the end. This is confirmed by the 
solo occurrences of such texts. 


3.4 Analysis 4: Comparison with other scripts 


Naranan and Balasubrahmanyan (1992) have used the statistics 
of word frequencies in different languages. Quoting the work of Zipf 
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(1935, 1949) they analyse the word frequency distribution and use the 
power law relation between W(k) and k, where W(k) is the number of 
different words or word types occurring k times in a discourse of words. 
They assume the relation W(k) = Ck”. Naranan and Balasubrahmanyan 
have given the graph of W(k) vs. k for different languages and writing 
styles and compared it with the Indus style. In figure 2 we have plotted 
the value of -y for different languages and authors and along with the 
value for the Indus writing (last point in figure 2). As can be seen from 
the figure, the slope for Indus writing is significantly different from that 
for other known writing styles. 


Word Frequency Distribution for different Languages 


=> 
-) 
= 7 
a 
3 
Cc 
eo - 
- 
ie 
vo 
— 
= 


Eng 1 Eng2 Shakes. Latin Chinese Joyce Russian Indus 


Language 


Fig. 2: Word frequency distribution of different languages (based on 
Naranan and Balasubrahmanyan, 1992). 


Conclusions from Analysis 4 


1. Unlike a pictographic script such as Chinese writing where each sign 
stands alone in meaning, the Indus script is not a simple 
incrementally built up script. Indus script is also not alphabetic. This 
is also clear from the fact that Chinese language has more than 3000 
different signs compared with 417 signs in Indus script and 26 in 
English. However, some signs do seem to be of pictorial nature. 


2. Study of sign combinations may be more meaningful than a study of 
individual signs. 
4.0 Summary 


We therefore make the following conclusions about the Indus 
script. 


1) The writing is highly ordered. 
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2) Typical length of information containing units is 2, 3 or maximum 4 
signs. However, they are not always complete enough to exist as 
standalone piece of text. This also indicates a more complex 
grammar in the writing where information units need proper 
beginners or enders (see tables 4-6). 
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Abstract 


We adopt a comprehensive approach to segment the Indus texts 
using Statistically significant signs and their combinations in addition to 
all the texts of length 2, 3 and 4 signs. We find that we can segment 88% of 
Indus texts (of length 5 and above) by this method and hence it can be 
suggested that the texts of 5 or more signs can actually be seen as 
permutations of other frequent sign-combinations or smaller texts (of 
length 2, 3 or 4 signs). The results of the segmentation process are in 
agreement with our earlier results (Yadav et. al, 2008, henceforth referred 
to as Paper 1) where we show the importance of 2, 3 and 4 sign 
combinations as important units of information. We do not assume 
anything regarding the content of the script and the work is purely based on 
the structural analysis of Indus Texts. 


1.0 Dataset: We use electronic concordance of Mahadevan (1977), 
henceforth referred to as M77 (For details see Paper 1). M77 records 
417 unique signs” in 3573 lines of 2906 texts. We remove texts that can 
have potentially ambiguous reading. We create an Extended Basic 
Unique Data Set (EBUDS) by removing all texts containing lost, 
damaged or illegible passages marked by diagonal lines and doubtfully 
read signs marked by asterisk. All texts from, multi-lined sides are also 


1. Address for correspondence: y_nisha@tifr.res.in. 


2. The serial number of the signs used in this paper is as given by Mahadevan in his 

_ concordance (1977). As a convention followed in the present paper, the texts 
depicted by pictures are to be read from right to left, whereas the texts repre- 
sented by just strings of sign numbers are to be read from left to right. — 
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removed. However, we assume that in objects where writing is found on 
several sides, the text on each side is independent of text on other 
side(s). We retain texts from those sides of multisided objects that have 
only one line of text. Texts appearing more than once are taken only 
once. We do not take into account the variation due to archaeological 
context of sites, stratigraphy and the type of objects on which the texts 
are inscribed. 


The unit of textual analysis for the study of distributional 
statistics is a line of text. There are two reasons why it is not possible to 
consider the whole text on a single side as a unit for this purpose. 
Firstly, there is no way of knowing beforehand whether different lines of 
an inscription appearing on the same object or even on the same side 
have continuity of sequence or to be regarded as separate texts. 
Secondly, it is not possible to ascertain beforehand the real order (if 
any) of the lines of text appearing on the same side (Mahadevan, 1977, 


p. 10). 


EBUDS contains 1548 texts. In EBUDS, 40 signs out of 417 
present in the Sign List of Mahadevan do not make their appearance. 
Out of these removed 40 signs, one sign (sign number: 374) appears 9 
times, one sign (sign number: 237) appears 8 times, two signs (sign 
numbers: 282, 390) appear 3 times, three signs (sign numbers: 324, 376, 
378) appear twice and thirty-three signs appear only once in M77. 
Hence all these 40 signs are rarely occurring signs and their absence in 
EBUDS does not significantly alter the patterns of writing. 


2.0 Segmentation Approach 


The Indus texts can be segmented by any of the following 
methods. 


a) Comparing two texts?: Two texts that are identical except for a few 
signs at the beginning or end can be compared and it can help us 
extract the segments (Mahadevan, 1978). 


b) Using frequent combinations of signs’: There are some frequent 
combinations of two-signs, three-signs etc. which can be treated as 


3. The term "text" implies complete line of text of Indus signs and EBUDS consists 
of 1548 such line of texts:with variable lengths (1 to 14 signs). 


_4."Frequent combination of signs" is a combination of Indus signs present 
- anywhere in the text. They are characterised by their frequent rate of occurrence 
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segments or identifiable units merely by their frequent rate of 
occurrence (Mahadevan, 1978). In Paper 1 we had shown that their 
frequency is far greater than would be expected by random chance. 


c) Using sign-pair frequencies: The strongest and weakest junction 
points in a text based on the frequency of adjacent sign-pairs can be 
used for segmentation (Mahadevan, 1978). 


d) Using Single Signs: Single signs falling in the categories of frequent 
beginners, frequent enders, and frequent auxiliary enders can be 
used to segment these texts. 


All these methods are cumulative and overlapping. Hence, it 
becomes critical to decide which method should be given priority over 
others for the process of segmentation so that we end up with 
meaningful segments. 


We adopt a step-by-step approach to segment the Indus texts of 
3 or more signs. We have used statistically significant units 
(combination of signs or single signs) in addition to all texts of length 2, 
3 and 4 for the process of segmentation. The following section discusses 
the various segmentation units in detail. 


3.0 Segmentation Units 


Segmentation units are defined as the texts (of 2, 3 or 4 signs) 
and other statistically significant units used for segmentation of Indus 
texts. The segmentation units are: 


1) Two-sign, Three-sign and Four-sign Texts (Table 1) 
2) Frequent sign combinations of 2, 3 and 4 signs (Tables 2-11) 


3) Single Signs: Text Beginners, Text Enders and as Auxiliary Text 
Enders (Tables 12-14) 


Each of these units is explained now in detail. 


in distinct Indus texts. They can be viewed as part of a complete Indus text but 
sometimes that combination does appear as a complete Indus text. One ex- 
ample of such frequent sign combination is "267, 99" occurring 168 times in the 
complete corpus of EBUDS. It appears as an independent text once in EBUDS. 
Another example of such frequent sign combination is the sequence "336, 89, 
211". 
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3.1 Two-sign, Three-sign and Four-sign Texts 


The two-sign, three-sign and four-sign texts that appear as 
complete texts in EBUDS form the first set of segmentation units. 
Table 1 gives the number of texts of various lengths (in terms of number 
of signs) in EBUDS. | 


Table 1: Number of texts of lengths 1 to 14 in EBUDS 


Number of Texts (EBUDS) 
69 


Number of Signs in the Text 


ee 


Oo OA NHN Ff WH WN 


pa 
—) 


baed 
bd. 


ray 
No 


a 
> WD 


As can be seen from table 1 EBUDS has 189 texts of length 2 
(P1 to P189), 284 texts of length 3 (T1 to T284) and 263 texts of length 
four (Q1 to Q263). 


3.2 Frequent Sign Combinations of 2, 3 and 4 signs (Beginner, Ender 
and Middle) 


Frequent sign combinations of 2, 3 and 4 signs that appear 
predominantly (= 50% of times) at beginning, ending or middle 
positions in Indus Texts (Tables 3-11) form the second set of 
segmentation units. 
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Table 2 
Selection Criteria of 2-, 3- and 4-sign combinations 
used as segmentation units 


Number Sign-Combination 
Two-sign 


Three-Sign 


Four-sign 


[MF = Maximum Frequency; TFC = Total Frequency Cut-off] 


* The cut-off for total frequency of occurrence is selected by 
taking into consideration frequency of occurrence of most 
frequently occurring combination in the respective 
category. The beginner, middle and ender combinations of 
4, 3 and 2 signs are given in tables 3-11 respectively. These 
were used for the segmentation of the texts already 
segmented using two-sign, three-sign and four-sign texts 
(section 3.1). 


Table 3: Beginner Four-sign Combinations 


Si. | d Beginner Four-sign combination Total | Percentage of _ Marker | 
No. | «Frequency of | occurrence as 
Sign No. _ Pictorial version occurrence | beginner | | 


| 
! 


Middle Four-sign Combination 
Sign No. 
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Table 5: Ender Four-sign Combinations | 


Ender Four-sign Combination Total | Percentage of Marker | 
Frequency of | occurrence as 
wea lt Pictorial version | "gceurrance | ender 
178 PGA | QeE1 
8 tf Bee t% 8s ' 
Bar Ff IP IO £ i 
ac | 6 0, San a | 100 QE3 
| 34° Wome $2 j 
re fsa] via’ : Qed 
| 6 89 HW se Qe 
27186 330 % © ' 
| : . 
6 | 70 89 QE6 | 
! 
; ; empl j 
| 7. , i 89 : ht QE7 / 
| | 241 66 23690 ! 
| 8 | 72 336 | e9 | 211 ges | 
| l oe Ae 
9 | 175 | 230; 53 = 342 Wary § 80 QE9 
| 10 13396! e9 | 216 | 256 | XH 4 | 75 | Qe10 
i tts Dts OF 2h ) 
; 14 | 65 | QE11 
| 
’ ! 
| 42 | 61 | QE12 | 
eth ee ae QE13 
! | 
4°" or | Qe14 
j 


Total Percentage of | Marker 


ee eRe ' Frequency of | occurrence as |; 
| | Sign No. Pictorial version b> culdnide beginner | 
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Table 7: Middle Three-sign Combinations 


Si. , Middle Three-sign Combination Total | Percentage of Marker 
NO. ean NA 1} Dietasial Uaraian tT rrequency.of | occurrence et 

: Sign No. | Pictorial version occurrence | middle 

’ / j ; 

| 4} 99 67 59° Wall | 16 | 100 ™1 | 


| 3 | 51 130 149 | xem | 19 79 TM3 
/ : tee 1 4! 

; ! ; ( —/ . 

; 4 } 8 171 59 | vr oOWANW ) 10 60 TM4 


Table 8: Ender Three-sign Combinations 


/ 


SI. Ender Three-sign Combination Total Percentage of | Marker 
i a RS. MA ee 
Sign No. Pictorial version PE gia Reactor ! 


25 | 245 | 245 
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Table 9: Beginner Two-sign Combinations 


Si. | Beginner Two-sign Combination | Total Percentage of = Marker 
No. [-. _ ;|.,.....,.) Frequency of | occurrence as 
Sign No. Pictorial version acenmentll a iaihae | 


BS 
cree f 2 100 PN 
OS Se ee 
es cen 
a a SN 2 100 PMS 
a wre paella ani = 
= 
6 NS 16 Fs. x 9c PMS 
wm Ne = 
rj 33. 2 lin S SS Pu? 
& ix 
8 | 20 | 162 Yd | cI 8s PMS 


s | a! 68 as 3¢ arts) 
. SS **> a 
10} 8? | 6 | rs Peto 
ee | ee 
\ x f 
12 | 171 : ; . 21 ; oS Putt 
ee 
TT 
: { > Nas ‘ 
2) $7 ae a. \ TS _ Pet? 
= , : 
" Yrs 4 
i] wo ! o ee 2? . t4 a 
aes Ss Se ee ee ee Pe -. 
Fr >| ‘Xo é 27 70 Petia 
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Table 11: Ender Two-sign Combinations 


| Si. Ender Two-sign Combination | TotalFrequency | Percentage of 
No. Sign No. Pikes Veraion of occurrence | poeuyasies as 


be | 342 | 176 EU 69 | 97 


3.3 Using Single Signs (Beginner, Ender and Middle) 


Marker | 


PE' 


Text Enders, Text Beginners and Auxiliary Text Enders form the 
third set of segmentation units. Based on the percentage of occurrence 
at the beginning, middle or end of texts, we categorise the most 
frequent signs as Text Enders, Text Beginners and Auxiliary Text 


Enders. Each of these is explained below. 


i) Text Beginners: Text Beginners are defined as signs appearing 
predominantly (= 50% of times) at the beginning of texts (Table 


13). 


li) Text Enders: Text Enders are defined as signs appearing 
predominantly (>= 50% of times) at the end of texts (Table 12). 
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it) Auxiliary Text Enders: Auxiliary Text Enders are defined as signs 
appearing predominantly (= 50% of times) at the middle of texts 
(Table 14), generally preceded by Text Beginners.. 


These are listed in tables 12 to 14. 


Table 12: Text Ender Signs 


Total Frequency | Percentage of | Marker 


of occurrence occurrence as 


Sl. Ender Sign 


“ Sign No. | Pictorial version 


ote | @ | « {= 


Beginner Sign 


Pictorial version 
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Table 14: Auxiliary Text Ender (Middle) Signs 
Total Frequency | Percentage of 


of occurrence | occurrence at 
middle 


Auxiliary Ender Sign 


Sign No. | Pictorial version 


4.0 Method employed in segmenting Indus texts 


We focus on segmenting the 734 texts of 5 or more signs to see if 
they are composites made of smaller information units. The steps 
followed in the segmentation process are explained in Figure 1. 


’ 


Step 1 
Search for two-sign, three-sign and four-sign texts successively 


Step 2 
Search for frequent four, three and two-sign combinations 
successively 


Step 3 
Search for text enders, text beginners and auxiliary text enders 
successively 


Figure 1: Steps for Segmentation of an Indus Text 
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Step 1: Search for two-sign, three-sign and four-sign texts successively 


We start with 189 two-sign texts as basic segments and search the 
whole dataset of 1290 texts (with 3 or more signs only) for these basic 
segments, marking them using different markers wherever found. 


This is followed by similar search for 3 and 4 sign texts 
respectively on the resultant dataset (dataset which had been searched 
for two-sign texts). We give importance to smaller texts (here two-sign 
texts) over three and four sign texts because a larger text could be a 
combination of one or more smaller units and the independent 
occurrence of the smaller unit increases the probability of smaller unit 
being a unit of information. The segmentation process is executed as 
follows: 


@ We take all stand-alone texts of length 2, 3 and 4 as complete units 
of information. 


e@ For this analysis, we do not take single signs which appear solo. 
There are 69 signs in EBUDS that appear solo and they may 
artificially split grammatically significant units of information. We 
know that there are several cases where a given sign appears solo a 
few times, but appears with a specific other sign far more frequently 
indicating that its two-signed appearance carries far - greater 
significance. Hence as an approximation, we begin with texts of 
length 2 or more. 


@ We segment larger texts using the two-sign, three-sign and four-sign 
texts successively. 


@ We split first with two-sign texts which represent smallest bits of 
information. At the end of this step 45% of texts (of length 5 and 
above) remain unsplit. 


Step 2: Search for frequent four, three and two-sign combinations 
successively 


The resultant dataset (from step 1) is then segmented using 
frequent 4, 3 and 2 sign combinations successively. These are listed in 
tables 3-11. The segmentation process is executed as follows: | 


e Inthis step, we search for frequent sign combinations. 
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@ Since these are not found stand-alone very often, they may or may 
not be complete. However, irrespective of whether they are 
completely stand-alone or not, they do represent identifiable units 
of information which can be islanded from its neighbourhood of 
signs. We therefore search for such frequent sign combinations in 
the resultant data set (from step 1). 


@ Unlike step 1, we reverse the order while searching for frequent sign 
combinations as four, three and two successively, since a four-sign 
frequent combination is more likely to be a significant unit than a 
two-sign frequent combination. 


e At the end of this step 23% of texts or segments (of length 5 and 
above) remain unsplit. 


Step 3: Search for Enders, Beginners and Auxiliary Enders successively 


The Indus texts after undergoing segmentation using 2, 3 and 4 
sign texts (step 1) and then by frequent sign combinations (step 2) are 
subjected to further segmentation using statistically significant Text 
Ender, Text Beginner and Auxiliary Text Ender (Middle) signs. 


e@ Incase a text or segment of 5S or more signs is not segmented by step 
1 and step 2, we try segmenting the same based on frequently found 
text beginners or text enders. 


e@ At the end of this step 17% of texts or segments (of length 5 and 
above) remain unsplit. 


@ We then use ‘auxiliary’ text enders that commonly appear just after 
the standard text beginners, for segmentation, and at the end of this 
step 12% of texts or segments (of length 5 and above) remain 
unsphit. 


The complete procedure results in splitting 88% of the texts (of 
length 5 and above) in EBUDS. The results are tabulated in table 15 
(Figure 2). 


5.0 Results 


In table 16, we list out the number of segments of various lengths 
after segmentation. The length vs. frequency of texts or segments is 
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Step 1 
Search for two-sign, three-sign and 
four-sign texts successively 


55% of 734 texts split 
Step 2 


Search for frequent four, three and 
two-sign combinations successively 


77% of 734 texts split 
Step 3 


Search for Enders, Beginners and 
Auxiliary Enders successively 


88% of 734 texts split 


Text Segments 


Figure 2. Results of Segmentation Process 


given in Figure 4. EBUDS before and after segmentation is given in 
Figure 5. 


It must be noted that if these units 1.e. 2, 3 and 4 sign texts do not 
significantly contribute to the process of segmentation of larger Indus 
texts then considermg them as segmentation units becomes 
questionable. However, finding them as part of larger Indus texts in a 
frequent manner justifies the nature of these 2, 3 or 4 sign texts as 
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Table 15: Results of han ea starting with 734° texts 


aan ae i 


NSSAR INSSAR SP USP | USP 
Texts of length 2, 3, 4 


Frequent Combination of length 
4,3, 2 (Beginners and Enders only 


Frequent combination of length 
4,3, 2 (Middle) 


By Text Enders 


By Text Beginners 
By Auxiliary Text Enders 


[N = Number; NSSA = Number of segments of length 5 and 
above; NSSAR = Number of segments of length 5 and above 
remaining unsplit; SP = Split % of 734 texts taken for 
segmentation; USP = Unsplit % of 734 texts taken for 
segmentation] 


* There are 734 texts of length 5 and above in EBUDS. 


Table 16 
Number of texts of Lengths 1 to 14in EBUDS 
before and after segmentation 


Number of Signs Number of Texts (EBUDS) 


1 
2 
3 
4 
" 
6 
7 
8 
9 
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Number of Texts or Segments 


—e— EBUDS before, _ 
segmentation 


a + ee 


ane aca ee EIT IIS, N ceel 
segmentation | 


B00 + 
600 -—=———__—_ 


400 - 


1 2 3 “ 5 6 7 8 9 Sime | 12 13 | 14 


Text or Segment length 


Figure 4 
Segment Length vs. Segment Frequency in EBUDS 
before and after segmentation 


consciously written important pieces of information. Table 18 lists most 
frequent segments (Texts or Frequent sign combinations occurring 1n 


EBUDS after segmentation). 


Table 19 lists few examples of Indus Texts segmented using this 
method. The number in the first column is the object number (in M77) 


ijdl Vol. XXXVII No. 1 69 Nisha Y., Vahia, Mahadevan, Joglekar 


EBUDS before segmentation 


n 


EBUDS after segmentation 


eT 


RSfago0ornnneos 
-_ oe weoc (ak th mt tt A) 
oO 


Figure 5: EBUDS before and after segmentation 


of the complete text. The number at the bottom of each smaller 
collection of signs is the object number (in M77) on which that segment 


appears. 
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Table 18 


i Sl. | Frequent segments in EBUDS after segmentation ' Total Frequency | 


of occurrence 


6.0 Conclusion 


It is possible to segment 88% of Indus Texts into segments of 
length 4 and below by using statistically significant signs and their 
combinations in addition to all the texts of length 2, 3 and 4. Based on 
the analysis of the segments obtained as a result of the above 
segmentation process we draw the following conclusions: Bons 


1) Many frequent sign combinations make their appearance as. : 
independent texts and hence considering these frequent ‘sign 
combinations as units of information 1 is justified for segmenting these 
texts. rs 
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Table 19: Few Examples of Segmentation 


Pear are geen EEE Ree ns 
| Object No. Segments of Text / 
; : | 


2) The frequent sign-combinations which appear as independent texts 
are those that most often occur at the beginning or end of Indus 
Texts. The frequency of occurrence of a frequent sign combination 
which often comes at the middle of Indus text, as an independent 
text, is quite low. 


3) The graph of segment length vs. segment frequency (Figure 4) again 
shows the importance of 2, 3 and 4 sign segments (Paper 1) that are 
far more frequent than the large segments and hence larger texts can 
be seen as a combination of small segments of information. 


4) The Indus texts after segmentation can be viewed as permutations of 
the identifiable units (segments) of 2, 3 or 4 signs. The identifiable 
units may or may not be standalone (or complete) pieces of 
information. 


The nature of Indus writing that emerges from this and earlier 
work (Paper 1) is as follows. The written material is ordered in a 
statistically significant manner. The usage of signs is not uniform and 
nor is their pairing. There are clearly some important signs that appear 
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far more often than other signs. Similarly, there are sign combinations 
that also appear to be intentionally paired. These aspects were 
discussed in Paper 1. 


The study presented here indicates that these frequent 
sign-combinations have an additional property. These frequent 
sign-combinations appear to be placed within a larger text in specific 
sequencing. The standalone texts and most frequent signs and sign 
combinations are in fact parts of larger texts. This indicates that larger 
texts are a conglomeration of smaller texts or information units. 
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Introduction 


The relationship of language and nation has long been a curious 
package of contradiction. In parts, we have agreed that in many forms 
the 19th-century concept of nation-states has become obsolete. 
However, quintessentially, questions remain to be answered: 


1. Today how do we dare to define what nation is? 
2. What should it comprise? 
3. On which basis should a nation be formed? 


4. Whether or not subjective factors such as common history, 
consciousness and meta-cognitive awareness should be regarded as 
primary? 


Hardly any convincing answer may be granted. Above all, it 
should be taken for granted that the connection between language and 
nationalism is strong and cannot be ignored in any such discourse at any 
point of time. In the nineteenth century, search for collective national 
identity led to emphasis on a common language. It was a time when the 
national unit was commonly defined in terms of language, and in some 
cases it has prevailed in the twentieth century also. Here, it may sound 
interesting to consider the famous French slogan: 


"As you speak the French language by nature, it is 
reasonable that you should be the subject of the king of France. I 
quite agree that the Spanish language should belong to the 
Spaniard and German to the German. But the whole region of 
the French language must be mine." 
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This parametric feature for defining nationalism entails the 
insider’s way of looking at the process of formation where the principle 
of self-determination is largely based upon language. Carl Darling Buck 
argued, "A people without a language of its own is only half a nation" 
(Buck 1916:47-49). The factor of common language can never be taken 
as an only essential ingredient in the formation of pan-nationalism 
which the western world has tested without knowing much about the 
recipes which are otherwise more tasty and hygienic. If language had 
been always the only ingredient for self-defining and self-categorizing 
the nation, there should have been several thousand sovereign states 
_ with their own legitimate sharp boundaries. It goes without saying that 
language must be considered as a force in shaping nationalism but it 
can never be stretched to an extent that it remains only and the only. 
Nationalism as an abstract being gets translated into physical body 
through different modes of significance and value. Kohn rightly points 
out: : 


"Nationalities come into existence only when certain 
objective bends delimit a social group. A nationality generally 
has several potentially unifying elements; very few have all of 
them. The most usual of them are common descent, language, 
territory, political entity, customs and traditions, and religion 
(but) none of them is essential to the existence or definition of 
nationality" (1945:13). 


The nationalism of Pakistan, Israel, Ireland, Sri Lanka, Poland, 
Burma, Greece and Bangladesh, which are significantly based upon 
religion and not upon language, is a point to be noticed with utmost 
care. It is noticeable that the African nation identity is not based 
primarily upon the language and other features appear to be 
pre-eminent self-definer of national identity. Edwards rightly points out 
that (1985:37): 


"Nationalism with or without a linguistic component is 
based essentially upon same sense of grouping which informs 
ethnicity. Particular objective criteria are important but none is 
essential to the formation and continuation of nationalistic 
consciousness. This means that nationalism is largely a 
non-rational phenomenon; indeed, some have seen it as frankly 
irrational." 
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In the process of self-defining and categorizing, a parameter 
such as ‘the only’ has succeeded in making the issues of nationalism 
even more vulgar at the conceptual level. For this reason alone, it has 
created conflicts and tension in the history of mankind. Today, we have 
all reasons to come out from the western’s hopelessness. All those who 
are suffering from such syndrome must be reminded that nation is a 
space, either existed in memory or built up with efforts, for heralding 
continuous hopes by legitimising at-home-ness. Khubchandani (1983:6) 
has rightly pointed out: "Insular societies such as nationality group in 
Europe, in contrast, are marked by congruent identities terminating 
roughly at the same boundary, and thus are unable to clear cut 
categorization". | 


East-West Paradox 


In the western countries, minority ethnicity is something ‘private’ 
while majority ethnicity is a ‘public’ phenomenon. Looking at the 
texture of western pluralism, one is forced to conclude that it is a 
potential source of destabilization in the individual makeup. Misery and 
-unhappiness over hyphenation need to be examined in a refreshingly 
new way. Perhaps, even more naively, fundamental questions such as 
‘Was it always so?’ and ‘Why, when or how did it become so?’ need to 
be raised. At this juncture, it is interesting to see how Smolicz (1979: 
95-96) gave an account of this problem: 


"The terms American, Canadian and Australian have 
come to refer to the person’s current allegiance to a nation-state 
with its political institutions and economic structures, while 
labels such as Irish, Ukranian and Italian indicate ethnic 
ancestry and cultural traditions in other areas of life such as 
family, friendship, religion, language and manners. Viewed in 
this way, the two types of identities are not mutually exclusive; it 
is quite possible for a person to identify himself (SIC!) as both 
Irish and American, Ukranian: and Canadian, Italian and 
Australian." | 


Therefore any discussion concentrating identity and national 
fabrics of India must represent a break with traditional models. At the 
turn of the new century, it is probably ripe for attempting to bring these 
- issues into a thorough inquiry. A thorough introspection will certainly 
help us, in a refreshingly new way, in dealing with the insularity between 
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East and West and protecting the interest of each group and stopping 
(.... one) from running over (.... others). Since heterogeneous and 
poly-glossic societies such as those that go into making India a polity 
distinct from the typically singular western societies in many 
fundamental ways (Singh & Singh: 1999), these two kinds of societies 
deserve absolutely different kinds of theoretical modelling. This work 
does not take any either-or position in favour of plurality that exists in 
the West or in the East. Aversion for ours reveals one’s ignorance and 
arrogance. This world cannot be found fault with solely on the basis of 
what has been said or what is available in that world. If the ‘First World’ 
is a purist’s dream and viewed as integrative, then the ‘Third World’ is 
meta-cognitive and a culturalist’s ideology (Singh, 2001). One can 
conclude with firm conviction that a thesis that guided, consciously or 
unconsciously, so much past research on identity formation and 
national fabric, was to a large extent, a result of faux pas generalization 
from the limited western experiences. 


Begin with the West 


Bromley (1984:9) says self-identification is usually denoted by 
the term ‘ethnonym’ derived from the word ‘ethnos’. Ethnonymic 
self-identification is what he calls is endoethnonyms and it may differ 
from excoethnonyms, 1.e. ethnonyms given by other people. He gives 
examples of ethnos with the endoethnonym as "Deutsch" so named 
because of nemtsy in Russian, allemagne in French, tedesco in Italian. 
Bromley further: realizes that this condition of self-identification may 
prove insufficient for identifymg ethnos. He opens two other 
possibilities: 


1. Politonyms denoting "all people (citizens) living within the 
boundaries of one or another social organism.... or political 
formation" (page 10); 


2. Toponyms denoting "the group’s self-identification originating from 
the name of the territory it inhabits" (Jbid. 10). 


In that case, Belgians and French provide us with examples of 
one’s politonyms. His theory becomes more complex when he says: 
when a group of people move outside the original settlement, both 
- politonyms and toponyms are often not retained by the second and 
further generations. He gives an example: 
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"When the children of Russian settlers in Siberia resettle 
in the European part of the U.S.S.R., they will call themselves as - 
Russians, and not Siberians, i.e. they will miss their toponyms 
(Siberians) and will retain their ethnonym (Russian)" (page 10). 


In certain conditions, all three or two coincide. As he says: 


"When the politonyms of a group of settlers is retained 
from several generations, this means that the given 
self-identification is also the ethnonym. The coincidence of 
poltonyms and ethnonyms is especially characteristics of 
relatively mono-ethnical states: for instance, the words 
‘Frenchman’ (Francaise) or ‘Italian’ (Jtaliano) are both 
politonym and ethnonym" (/bid. 10). 


Skutnabb-Kangas uses the term endoethnonmy for the Finns in 
Finland using for themselves Finnish (Suomalaiset). “The Finns’ can 
thus denote both as the ethnonym, a politonym or a toponym._ 
According to her, ‘ethnonym’ indicates the ethnic group of 
Finnish-speaking Finns in Finland in opposition to ‘Finland-Swedes’, 
suomenruotsalaiset in Sweden and finlandssvenskar in Swedish. It 
declares all citizens of the state of Finland as ‘politonym’ regardless of 
their language or ethnic group whereas as ‘toponym’ includes all those 
who live in Finland. 


Now, it emerges from the above description that convergence of 
toponym, politonym and ethnonym is a major problem to find out the 
implication of self-identification in many cases (especially for the 
Finnish-speaking Finns in Sweden). It is a futile exercise to try to do it 
linguistically. Therefore, it is reasonable to leave the politonym part by 
asking people themselves about their group affiliation or citizenship. 
However, one can find another problem if someone changes nationality 
whereas many remain Finnish citizens. Then how can we differentiate 
between the politonym and ethnonym. What will happen in this 
situation is that if, say a Finnish-speaking Finn in Sweden describes 
himself as a "completely Swedish" person, as this could be interpreted 
an ethnonym, politonym and toponym. 


But problems do not end there: such a contrast does not help in 
clarifying the meaning for those who have not changed their nationality. 
The main difficulty then remains as to how to differentiate politonym 
from ethnonym, particularly for those who have not changed 
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nationality. As a sample, it is interesting to mention at Skutnabb-Kangas 
(1987) empirical investigation about self-identification of Finns in 
Sweden for whom complete switch over to Swedish would signal 
assimilation and would be mainly interpreted as an ethnonym whereas 
complete retention of only Finnish would signal segregation. In 
comparison, the other combinations such as Swedish and also Finnish, 
Finnish and also Swedish or both Finnish and Swedish signal tendencies 
towards integration. In this connection, the basic difference between 
Swedish-Finn and Indians living with labels such as Bengali, Bihari, 
Tamilian, Marathi, etc. is that signalling of these processes in the first 
one raises certain international issues whereas in the latter one, it is 
restricted to the state level and the issue is intra-national. Signalling of 
assimilation and segregation of Swedish Finns raises the question of 
whether they belong to this country or that country. The reverse is true 
in the Indian case. Although signalling of assimilation and segregation 
might uproot them from their host states but super-ordinate marker, 1.e. 
their being ‘Indian’, allows them to remain intact. 


Skutnabb-Kangas (1987) rightly points out that variable 
identification is the best solution. That would vary depending on to 
whom or where the identity is presented. That 1s, for a person A or in 
country 1, one could define himself/herself as X, or X than Y, or Y but 
also X. If this whole problem of identification is viewed from the 
country of origin, then only ethnonym remains but not toponym and 
politonym, but if it is viewed from the point of new country, then it may 
be toponym, politonym or ethnonym, or a combination of all three. 
Whatever it may be, self-identification in hyphenated form cannot be 
avoided here. For her, this hyphenation denotes some sort of a 
hierarchy as indicated in the following figure of Greek-American: 


Greek (ethnonym only if seen from Greece 
point of view) 


American (ethnonym, politonym and toponym, 
seen from U.S.A. point of view) 


The following figure will show this: 


Irish (ethnonym) 


American (politonym and toponym) 
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She recalls various lacunas in this pluralistic model. Any 
permanent solution for ‘hyphenation’ seems to be remote. It is almost 
impossible to conclude whether hyphenation will precede identity or 
whether it will follow. She strongly says that this framework operates 
also on monistic contexts and is not typical of only western societies. 
She suggests two changes are needed to prevail: firstly, major ethnicity 
has to become conscious and, secondly, a purely monistic view has to be 
altered. If these two changes are made, it is possible to conceive of a 
space where one could belong to two (or more) ethnoses without any of 
these ethnoses accusing them of lacking in loyalty because of their other 
allegiances. However, she accepts that this ideal condition seems to be 
generally prevailing in many Asian and African countries but it seems 
to be not there in the European or Europeanised contexts. 


Therefore, her suggestion to bring changes to prevail real 
pluralism in western countries seems to be a conditional solution. In 
fact, she fails to understand as to why they cannot be described im 
UN-hyphenated ways. In India, on the contrary, when a group of 
people move outside the original settlement within the country, all 
upper levels, viz. ethnonym, politonym and toponym, are preserved. 
This may be presented diagrammatically as follows: 


A. 
+[N] +[N] 
~ 
(PSV (EP. 1) 
oe 
+ [N] (E, P, T) 
+[S] +[S] 
(E, P,T) (P, T) 
+[N] = Nation 


+ [S] = State 
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Immigration (change of permanent residence to another 
country) and migration (temporary residence away from country and 
place) are the two phenomena that have created multilingualism in 
many western countries. As a result, many second-generation migrants 
experience marginality, a feeling of not really belonging either to the 
native country their parents or to the country of birth or residence. It 
remains crucial to see how, and to what extent, they eventually establish 
their own identity in generating hyphenated existence. A few 
multilingual countries can be cited as examples here. 


Nation Linguistic Groups 


German 
French 
Switzerland 
Italian 
Romansch 
| Swedish 
Finnish 
Flemish 
a French 
German 
English 
cite 
French 
In each of the resolutions that we have reviewed above, we have 
encountered identity - in terms of current political affiliation, return to 


root, ethic memories and loyalty - as a potential source of 
destabilization in the individual makeup. 
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The Other Side of the Globe: Remodelling Typological Schemata 


Differences can be exhibited systematically by formulating 
typological schemata of ecological variables. In so doing, we would 
produce, in effect, a framework of variables, which would serve to 
illuminate marking features of self-identification at different levels. One 
could imagine, as well, that such scaffolding could be used to inform 
and guide relevant theoretical assumptions for demarcation. The 
following assumptions could be proposed: 


1. A comprchensive, multidisciplinary analysis of ethnonym, toponym 
and politonym will be intrinsically useful. 


2. Emerging generalities may be found which will permit comparison 
and classification of different contexts under certain rubrics. 


It is now possible to move a bit further onward. In what follows, 
let us make the best use of variables as category labels, which are also 
available as a general framework of ecological factors and realized in a 
hierarchical order: 


A. Demographic variables Home, community, region, state 
and nation 
B. Linguistic variables Mother language, home language, 


community language, state 
language, national language, 
traditional language, institutional 
language, colonial language, caste 
language, class language, etc. 


C. Social political variables Taluk, district, state and nation 


D. Socio-cultural variables Ancestry, clan, caste, religion, 
: socio-cultural groups and nation 


E. Socio-Psychological Commitments for preserving own 
| traits, tolerance, strong integrative 
tendency, national feeling, etc. 


One can point out that there are many other perhaps more 
subtle, but still important, variables which operate in this context. 
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Returning then to our objective, it can be predicted that. without 
describing the linguistic ecology as a whole, it would be almost 
impossible to draw the boundary of differences. Typological schemata 
has a number of distinct analytical advantages from the Indian point of 
view: 


a) It provides a dynamic bridge between objective and subjective 
approaches to the definition of Indian identity; 


b) It accounts for similarity in the construction of collective identity. 


Such a model is helpful not only in exhibiting the dynamic role of 
these variables in identity formation but also helps in understanding the 
political, demographic, linguistic and socio-cultural fabrics of India as a 
whole. Admittedly, through this project objective as well as subjective 
meanings in identity creation, projection and identification can well be 
explored. As a general guide to the analysis of ecological variables 
relating to the construction of identity at the national level, the 
following agenda can be proposed which accounts for: 


1. Preconditions of national identity, viz. those factors which account 
for collective identity; 


2. Bases of internal distinctiveness and uniqueness; 


3. Conditions which block conflict, hyphenation and anomie of 
identity; 


4. Likelihood of success. 


Typological schemata of ecological variables are needed because 
there are wide varieties of factor and ways in which Indians identify 
variably and distinctively among themselves being sufficiently Indian 
and insider. Metamorphosing and reduplicating of identity features do 
not disbar either from the apexial marking or from any other available 
in the hierarchical ordering. At each level, a new sense of positivism 
prevails without denying one another. Similarly, marked features for 
one do not necessarily act the same for others either at the 
corresponding level or at the uneven level. 


Paradox of Construction 


Let us be sure that it is still in the beginning phase and should be 
seen as preliminary sketches. This, then, is the state of affairs at the 
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moment. Clearly, much more work needs to be done before a usctul 
typology results. I believe that a comprehensive typology would be a 
useful tool for description and comparison, leading to more complete 
conceptualisation of identity. These questions will necessary help in the 
process, building upon previous insights and will also involve a 
considerable search of relevant literature: 


1. How highly regarded are the levels such as Kashmiri, Tamilian, 
Gujarati, etc.? 


2. To what extent do the following groups (as above) correlate 
themselves with the super-ordinate marker - (Indian)? 


3. How highly regarded are the levels such as Bihar, Uttar Pradesh, 
Himachal Pradesh, Uttaranchal, Jharkhand, etc.? 


4. How highly regarded and differentiated are the levels such as 
Kanauji, Kumauni, Garhwali, Bhojpuri, Braj, Awadhi, Maithili, 
Magahi, Sadan, Kortha, Chattisgarhi, Bagheli, Rajasthani, Marwari, 
Mewari, Malvi, Mandeali, Simauri, Chameali, etc.? 


5. How highly are Sanskrit, Urdu and English regarded as markers of 
self-identification, self-categorization and _ self-projection at 
ethnonymical, toponymical and politonymical levels? 


6. How are Konkani, Santhali and Sindhi identified at intermediate or 
subordinate levels? 


7. How and in which conditions are linguistic pockets such as Saurastri 
in Tamil Nadu and Bengali in Banaras surviving? 


In an attempt to answer these questions, one has to have a 
clear-cut answer for the following: 


A. How do Indians evaluate the importance of language as compared 
to religion, race, territory as a marker of collective as well as 
individual identity? 


B. Under what conditions does India as a whole assume greater 
importance to ethno-national communities? 


C. What are the causative variables that combine the nation into 
ethnonymical, politonymical and toponymical units? 
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D. What are the instruments of nation-building? 


E. How and under what conditions is the maintenance of linguistic 
pluralism carried out at the trans-ethnic political unit? 


To substantiate a better analytical perspective, let us begin with 
the formulation of these three categories, which may well be labelled as 
indigenous: 


1. Apex: National, culminated, vertex, high point 
2. Indoor: State, equidistant 


3. Base: Pedestal, core, root 


These orders presumably explain why (Indian-ness) supersedes 
various indigenous idioms either at the subordinate levels or at the 
grass-root levels. Here, the point to be noted is that the common 
consciousness of being a nation was derived from trans-historicity 
stretching back to immemorial times, which has transcended in the 
fuzzy order of nation-building. The heart of the matter is this: for any 
Indian, the conditions are reasonably favourable to attach himself at the 
apex, indoor and base levels at a time without accusing each other. This 
can be shown diagrammatically as follows: 


Indian 
Bounded _ territory 


Apexial Extension Output 
Indoor 


Freedom of 
choice 


Socio-cultural 


Surfacial oe 


I = Integration 
ED = Expansion Demographic 
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The ramifications of ecological variables make the substance of 
identity at all levels an enigmatic subject. Most importantly, therefore, 
issues which are to be investigated: 


1. As to what are the factors and preconditions which are likely to be 
operation in ensuing a ‘collective identity in such a diversity; 


2. What motivates an Indian to identify himself / herself differently at 
different levels keeping apex identity intact. Variables are mutually 
enforcing and reinforcing. 


Apexial Level 


In the heart of each and every Indian, there is a real self in which 
resides the essence of authentic self as an Indian. As a marker, it is 
neither patchable nor exchangeable. Being a culmination of all other 
categories, a maker such as this happens to be the open and reserved 
choice within which every individual is forced to define himself / herself 
differently. At ethnonymical, politonymical and toponymical levels, in 
the hierarchy, it has demonstrative capability to cheek all other causal 
differences that might sound even very sharp at lower levels. A marker 
such as this not only erases hyphenation but also blocks with force. 
Either subjectively or objectively, the most virtual manifestation of the 
top-order marker is that as an amalgamation of all, it is blessed with a 
novel crown. What is uniquely preserved at the apex level is an 
expression and privilege of altering, categorizing, maintaining and 
adopting personal identity without destroying and disturbing national 
identity. It may be viewed as a tenderness and pleasure of 
trans-historicality. As a matter of fact, it does not matter how sharply 
local and regional identities vary with political, linguistic, socio-cultural 
boundaries. | 


At the apex level, the reverse is true in the multilingual countries 
of the west, as selfhood is toponymically and politonymically legitimised 
and suffers from some sense of ethnonymical hyphenation. For 
instance, at the apex level, French-speaking Canadians and 
English-speaking Canadians possess dual identity within the 
politonymical, toponymical and ethnonymical boundaries. In such a 
situation, the issue of hyphenation consistently remains suspicious. For 
French- and English-speaking Canadians, the justification for (trans- 
gene-rationality) does not hold true. It may also hold true that in many 
multilingual countries of the west, identity features at apex level may be 
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toponymically, politonymically and ethnonymically unified, but indoor 
forms are not historically derived, as people of different ethnicities who 
join from different countries have well-defined national identity 
attached to their host country. Australia, with several long-settled 
ethnic groups such as Jews and Chinese, is composed of a number of 
smaller aboriginal inhabitants and the majority - staasvolk - is English- 
speaking of Anglo-Celtic origin. 


A French in America, with French or without French, may 
remain ethnically French until and unless he / she surrenders himself / 
herself. His French-American symbol remains unchallenged as the first 
part is related with ancestry, legend and history and the second part 
symbolizes adopted toponymical and politonymical identities. In this 
case, it is hard to decide which is more dominating, open and powerful 
and which part is less dominating, open and powerful. Migrants from 
the western multilingual countries face the crisis of hyphenation. 


Indoor Level 


At the immediate subordinate level, ethnolinguistic units are 
always breakable into politonym (political group), toponym 
(geographic region) and ethnonym (ethnic group). Nation, as a whole, 
is an amalgam of these units and at this level, all traits get integrated 
into such a way that, even theoretically, no more upward fragmentation 
sounds logical. At the subordinate levels, as many boundaries are 
possible but, taken together, justify the off-quoted idiomatic expression 
of ‘unity in diversity. Various names like Bengal, Tamilian, Telugu, 
Marathi, etc. are various ethnolinguistic units within the nation. In 
comparison, terms like Bihar, Uttar Pradesh, Madhya Pradesh, etc. do 
not evoke the sense of ethnolinguistic units but signify only political and 
geographical significant boundaries. 


In the hierarchy, below’ super-ordinate ethnonymical, 
politonymical and toponymical layers, there lies a complex network of 
‘indoor identity which can, in the real sense, be called branching of 
apexial identity into subordinate identities. By its virtue, it may also be 
called ‘medial’. Politonymically and toponymically, medial category is a 
result of post-independent political restructuring. At this level, 
ethnonymical definition of self does not always stand in one-to-one 
correspondence with politonym and toponym. In many cases, indoor 
identity also bears ethnonymical significance along with toponymical 
and politonymical. 
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On the surface, many features belonging to this category may 
appear as cosmetic in reallocating intra-levels. Expansion and 
distribution of subordinate markers with internal boundaries for 
maintaining group differentiation is always possible. Hence, indoor 
identity sounds like neither aberration nor devaluation. In fact, in the 
topological schemata, such duality represents tolerance. In all possible 
conditions where demarcation is made _ linguistically, other 
ethnonymical ingredients hold the position for justifying the logistics of 
the makeup. Indoor features are very handy for intra-differentiation 
and may also be treated as a binding force for liberating the 
self-perception, categorization and identification. Therefore, its 
legitimacy in socio-political divisionally is sanctioned. This makes it 
simpler to conclude that even the objective of linguistic divisions was 
largely based upon the logistics of strengthening the federal system and 
smoothening administrative functioning. The issue of state formation on 
the linguistic basis was an administrative advocacy. Within this 
category, if language is taken as a base, then six types of zone are 
identifiable: 


(a) Linguistically - Stable Linguistic Group with well-defined 
Demographic and Political Boundaries 


Ethnonym Toponym and Politonym 
Bengali Bengal 
Tamil Tamil Nadu 
Marathi Maharashtra 
Gujarati Gujarat 


Oriya Orissa 


Punjabi Punjab 


Kannada Karnataka 
Telugu Andhra Pradesh 
Malayalam Kerala 

Kashmiri Kashmir 


Assamese Assam 
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(b) Comprised Linguistic Group with Different Political Boundaries 
Politonym & Toponym Ethnonym & Toponym 


Uttar Pradesh Kanauji, Kumauni, Garhwali, Bhojpuri, 
Braj, Awadhi, etc. 


Bihar Bhojpuri, Maithili, Magahi, etc. 
Madhya Pradesh Chattisgarhi, Baghelli, etc. 


Rajasthan Rajasthani, Marwari, Dhundari, Mewari, 
Malvi, etc. 


Himachal Pradesh All Pahari languages: Rahari, Mandeah, 
Sirmauri, Kului, Chameali, etc. 


(c) Linguistic Group with no definite Politonymical and Toponymical 
Boundaries 


Santhali, Sindhi 


(d) Language without Indoor Boundary 


Apexial Level Sanskrit Traditional / Religious / Ritual 
Urdu Religious 
English Colonial / Official 


(e) Greater Linguistic Region 
Use of Hindi in Punjab, Gujarat, Maharashtra 


Punjabi in Delhi, Himachal Pradesh, Jammu 
(f) Survived Linguistic Pocket 
Saurastri in Tamil Nadu 
Bengali in Banaras 
Justifiable Dilemma 
A State may exhibit both configurations: 
i | +ethnonym, + toponym, + politonym] 


2. [-ethnonym, + toponym, + politonym] 
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In the second case, the first rule is largely violated because the 
States in the Hindi regions had been organized only on toponymical 
and politonymical grounds, but this gives a clear-cut picture that 
because of this variation, the super-ordinate level is not affected. 
Creation of States on the consideration of majority ethnolinguistic 
group has injected misunderstanding on many counts. 


An evidence of tolerance could be seen through this example. 
Dogri and Kangari in 1961 Census were regarded as varieties of 
Punjabi, but since 1971, under the reclassification scheme, Dogri has 
been recorded an independent language status and Kangari 1s 
reclassified under the Hindi amalgam. See the profile of Hindi as a 
mother-tongue as a result of return from first mother-tongue. 


197] Variation 
Uttar Pradesh 
Bihar 


Madhya Pradesh 
Rajasthan 
Himachal Pradesh 


The highest variation with 76.8% is noticeable in Himachal 
Pradesh between eight Pahari languages and Hindi. The trend of 
decrease and increase can be interpreted from many angles. Apart 
from comprised linguistic groups, elasticity or fluidity in the assertion of 
Hindi as mother-tongue is noticeable in the greater Hindi region, as the 
trend of gain between 5.4 to 6.1 was reported in the same census report. 


Let us come to the conclusion that indoor compartmentalization 
on the linguistic ground is largely based on majority configuration of 
language use within politonymical and toponymical boundaries which 
has ethnonymical significance. The naming of politonym and toponym 
units based upon linguistic criteria falls on the principle of 
predominance of language within the unit. Here are examples: 


Percentage of Population 


Bengali Bengal 84.3 
Gujarati Gujarat 90.5 
— Tamil Tamil Nadu 83.2 


Oriya Orissa 82.3 
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Punjabi Punjab 55.6 
Manipur Manipur 64.5 
Marathi Maharashtra 76.5 


Assamese Assam 
Kannada Karnataka 
Kashmiri Kashmir 


Malayalam Kerala 


Telugu Andhra Pradesh 


Rajasthani Rajasthan 


Tripuri Tripura 


Nicobarese Nicobar Island 


However, for two States - Andhra Pradesh and Kerala - 
linguistic criterion for politonymical and toponymical divisions seems to 
be justifiable but the name of the States does not follow a linguistic 
criterion as the terms like Telugu and Malayalee do locate a person’s 
politonymical and toponymical boundaries along with an ethnonymical 
emblem. Rajasthani language rightly stands as a justifiable criterion for 
characterizing Rajasthan as a politonym and toponym unit but 
ethnonymically, it is a part of the large Hindi populus. Similar may be 
the case with Tripura. Tripuri language with about 25% of population 
may be one of the factors in the development of Tripura as a separate 
politonym and toponym unit but the majority language of the State is 
Bengali with almost 62.5% population. Ethnonymically, language-based 
category is not absolute and permits overlapping, and in hierarchy, it is 
a three-tier ethnonymical, politonymical and toponymical unit. 
Undoubtedly, indoor boundaries, in many cases, do provide a linguistic 
sense of self-identification within well-defined toponymical and 
politonymical boundaries and also stand as a tool for self-categorization. 
The comprised linguistic group is largely Hindi-speaking which is 
distributed into seven politonymical units with wide regional varieties. 
Indoor politonym with toponymical organization is often made on the 
basis of unconceivable demands. The creation of new boundaries such 
as Jharkhand, Uttaranchal and Chattisgarh is perhaps the best example 
of politonymical restructuring for administrative purposes. In the third 
category, there are some traditional linguistic groups which carry the 
same politonymical and toponymical identities with those of the 
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majority group. This could be an answer to a question why these 
languages are surviving over years. 


Linguistic Group State 


Bodo Assam 
Dogri Jammu & Kashmir 


Konkani Maharashtra 
Saurastri Tamil Nadu 
Santhali Jharkhand 

Kui Orissa 

Kurkh Madhya Pradesh 
Coorgi Karnataka 


Linguistic consciousness is high and that is why now and then 
vocal resentments for separate political units are heard. Urdu presents 
. the best example of a language with religious affinity without any 
toponymical definition. Politonymically, Urdu receives a respectable 
position in many States as second, third or minority language. Hindi 
and Urdu present a live picture of unmarked interchangeability and 
intelligibility, as distinction between Standard Hindi and Urdu is not 
even as compartmentalized as two distinctly prescribed literary 
standards of Yugoslav language, Serbo-Croatian (Serbian in Cyrillic 
script and Croatian in Latin script). Since Muslim population is 
scattered throughout the country, Urdu cannot have an indoor 
politonymical and toponymical definition except ethnonymical within 
the apexial politonymical and toponymical and _ ethnonymical 
boundaries. A large proportion of Muslims in many regions tend to 
have bilingual control over the various languages of the regions along 
with Urdu. It enjoys the status of minority language with an additional 
privilege of being a second and third language in many States and has 
the privilege to attach lingo-religious markers for self-identification at 
any level irrespective of the majority group’s indoor politonymical and 
toponymical definition. In many cases, topological and politonymical 
and linguistic identities precede the religious and cultural features, as 
the names such as Tamil-Muslim, Kerala Muslim, etc. are often 
recalled with respect. However, in the Hindi belt, linguistic identity, 
other than Urdu, does not precede in the same fashion but is again hard 
to differentiate on a linguistic basis. } 
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Surfacial Level 


Some of the basic traits such as caste, religion, occupation, 
region provide alternative ways for self-identification to differentiate 
with others in such a hierarchical complex network but no single trait 
can be treated as an absolute one. Such basic traits provide interactive 
experience and also help in restoring a part of what has been lost. More 
than that, these traits transform our present and future cross-cultural 
relations accordingly. 


Paradox of Construction and Countable Differences 


Nation is an entity which may have evolutionary, formational, 
behavioural and cognitive features. In the process of formation, many 
factors are equally functional. The basic problem of theory-building is 
to identify significant components which correspond and integrate with 
the national image. The significant components must act as distinct 
features and signifiers in distinguishing one from another. A nation 
must create its own image. Here, some of the significant components 
could be identified. 


Significant Component 


Language Nation 
Korean South and North Korea 


Italian 
French 
German 
Spanish 
Portuguese 
Nepali 
Russia 
Religion 


Islam 


Italy 
France 
Germany 
Spain 
Portugal 
Nepal 
Russian 
Nation 


Pakistan 
Bangladesh 


Geography: Compulsory to all. 


Political Unit: Must for all. 
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In the first category, slogans like ‘Onc language, one nation’ 
resonate well. The linguistic ecology of these countries is plain. Even 
the name corresponds with language. Acknowledgement of recognition 
derives from the articulation significant components. The western 
world has played the politics of recognition in a smooth manner but not 
with equal dignity. 


Conclusion 


The Third World has experienced the challenge of living with 
cultural differences. In different ways, cultural streams acknowledge the 
‘third human space’ and connect to enduring values of togetherness that 
transcend immediate personal and cultural location. However, it is 
interesting to note that nationalism is not a contested term as the 
western world has conceived and experienced. In particular, in India, 
nationalism is shared history of interactive experience. 
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= Sanghamitra Saha (Ed.), HB, Demy 1/4, Rs. 1,750/- (US$ 226/-) - 


= [Vol. I: 1998, pp. x+459; Vol. Il: 1999, pp. xi+433] im 
=! Volume I: This volume deals with the background of the ‘s 


=: study, land and population, history, religion, folk culture '5 
= and social life of the people of West Bengal. It covers both 's 
= western and eastern regions of Bengal. This thematically- = 


ms arranged information-packed volume is well-documented. - 


= Volume II: This volume has 10 chapters: Economy, ‘= 
=, Language and Literature, Educational System, Art, Media, ‘= 
=, Intellectual Achievements, Administration, Cultural Contact, is 
=, Important Personalities and Places of Interest. Of them, {5 
=' some are fascinating, like the last one that brings before the !5 
= readers a panoramic view of the different places and their ‘5 
=: natural environs. Similarly, the chapter on cultural contact = 
=, covers the relation between Bengal and her neighbours, as «= 


= well as the distant South, in a picturesque description. - 
+ Moderately priced, both volumes have many - 
=' pictures. A comprehensive index covering the 2 volumes is ‘3 
=' given for readyreference. _ i‘ 
=| A GRAMMAR OF TULU (A DRAVIDIAN LANGUAGE) ‘= 
zs S.L. Bhatt, 2005, Demy 1/8, pp. x+388, Rs. 200/- (US$ 40/-) i 
m= An exhaustive study of the Tulu language submitted ‘ 


=: by Dr. Sooda Lekshminarayana Bhatt as a Ph.D. thesis to «= 
=, Wisconsin University, U.S.A., this book has an introduction, «= 
=| the locality of Tulu, its relationship with the other i= 
=, languages, phonology, morphology and syntax. One is 
=, noteworthy feature of the book is the lexicon containing is 
=, arrangement and pronunciation, grammatical categories, is 
=. derivations, Tulu words and idioms, etc. This well-printed ‘= 


: publication will be an asset to language students and Tulu : 
=, scholars. | : 
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SOCIO-LINGUISTIC STUDY OF 
ORIYA PERSONAL PRONOUNS 
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Abstract 


The paper presents an overview of the socio-linguistic study of 
Oriya personal pronouns discussing some of their characteristics which 
are relevant for the speaker-hearer interaction in the speech-act situation. 


Introduction 


Personal pronouns in Oriya can broadly be classified into 
participant and non-participant of the speech-act situation. The first 
person as the addresser (or speaker) and the second as addressee (or 
hearer) are called participants or interlocutors of the speech event 
while the persons or things other than those of first and second persons 
are called non-participants of the speech events who come under the 
third person. 


Further, Oriya personal pronouns can be classified into 
addressing forms that include both addresser and addresses of first and 
second persons respectively and the reference forms which are 
included in the third person. The first and second persons under 
addressing forms are used in the shared communication while the third 
person under reference forms remains outside the shared 
communication. 


Among the three types of personal pronouns of Oriya, the first 
and third persons do not pose any problem but the second-person 
pronoun is very much complicated from the socio-linguistic point of 
view. , 
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The present paper tries to highlight some of the aspects of Oriya 
personal pronoun in the following sections. 


Oriya Personal Pronouns: A Brief Description 
(1) First Person 
(a) / mu /- singular - "I" 
In old Oriya literature, mui, muhi, muhi, etc. are used. 


The term /mu / is used for the addresser himself to show his egoistic 
attitudes of the "self". 


(b) / @me or ambhe / - plural - "we" 
In old Oriya, aimhe, ame, etc. are also used. 


Generally, / @me / -(excl. / incl.) is used in the sense of 
togetherness with or without addresses while / @mbhe / is used with the 
singular reference to the "self" by the people of the higher caste and 
status. This term is used in the aforesaid sense in the speeches of the 
characters like deities, kings, saints and others of ancient and modern 
Oriya literature who have held some high positions in the society. 
Similarly, the term is also used in the speeches of Brahmins (especially 
in the speeches of the typical Sasani Brahmins) of Orissa. 


In this context, the reflexive pronouns like / @pa, Gpana, nija / 
"self may be mentioned here which are used as sui referential terms in 
Oriya. 


(ii) Third person 


(a) / e, ie, ei / - singular - “this (fellow)" and / emd@ne ehimane / - plural - 
"these (fellows)" are used as proximate pronouns while 


(b) / se, sie, sei / - singular - "he / she", "that (fellow)" and / semdane, 
seimane ~ sehimane / - plural - "they" / "those (fellows)" are used as 
remote / non-proximate pronouns. 


Further, the forms / e, ie, ei, sie, sei / are used in the 
non-honorific sense and the forms such as / ema@ne sehimane / are used 
in the honorific sense while / se / - “he / she" 1s used in both honorific 
and non-honorific sense. 
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(iti) Second person 


Basically, there are two personal pronouns, viz. / tu, tame / for 
the second-person singular in standard (colloquial) Oriya. The other 
varieties of the form / tame /, viz. / tume ~ tumbhe / (sgl. / pl.) are 
chiefly used in written Oriya. Besides this, the forms / tui tuhi / 
and / teme / are used in colloquial (non-standard) Oriya and the 
form / apana / is also used in the speeches of the Oriya people as 
second-person pronoun (sgl. / pl.). 


The details of these terms are given below. 


(a) / tu /(sgl.) "you" 


Frequent use of this term can be seen in the family and outside 
the family. 


The elders of the family use this term for the younger ones and 
both father and mother or husband and wife in the family. If they are 
uneducated, they use this form for each other. Love and affection 
between juniors and seniors of the family and intimacy as well as 
friendship between friends and between the people of the same age 
outside the family can be maintained by using the term / tu /. Moreover, 
the term is used in the prayers and devotional poems. 


On the other hand, the derogatory sense of the term / tu / can be 
realised in the case of the addressees who belong to the lowest ranks of 
the society and do not have power and high status in the society. The 
servants and menials are very often addressed by this term. Similarly, 
people when quarrelling among themselves or when blaming one 
another also use this term which shows a certain amount of emotion, 
anger, mutual jealousy and hatred-ness of the people engaged in the 
quarrel. 


Further, the use of / tu / indicates the maintenance of the social 
distance between the people of high and low ranks, say, between the 
master and the servant, the landlord and the tenant, etc. However, no 
such distance is maintained among the members of the family while 
using this term. Thus, when intimacy of the addresser with the 
addressee increases, the distance between them loses its significance 
and hence the social solidarity can be maintained. | 
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The use of / tu / in the conversation can either be reciprocal or 
non-reciprocal. 


The reciprocal use can be felt between friends and between 
uneducated / semi-educated husband and wife and between father and 
mother while the non-reciprocal use of / tu / can be felt in the 
conversation between parents and children, between the teacher and 
the taught, and between the master and the servant. 


(b) /tame ~ tume ~ tumbhe / (sgl./pl.) "you" (Honorific denoting 
slight respect) 


The elders within a family may use / tu / for the younger ones 
who in return use / tame or tume / or / teme / for the elders of the family. 
Sometimes, the frequent use of / tame or tume / is found in the speeches 
of the educated father and mother and the educated husband and wife. 


The teacher while teaching in the class and the master while 
talking to his servant may use the terms / tame or tume /. Moreover, the 
honorific / tumbhe / is mostly found in the speeches of the conservative 
Brahmins (especially in the speeches of the Sasani Brahmins who are 
well-known for their Vedic rites and rituals) of Orissa. The term / tumbhe / 
is also found in the speeches of the characters of Oriya novels, dramas 
and short stories of modern Oriya literature but frequent use of / tumbhe / 
can be realised in ancient Oriya literature. 


The non-honorific or non-polite expressions like / tui or tuhi / 
(sgl.) - "you" can be used on a few occasions for the addressee who has 
an inferior status in the society. 


(c) /a@pana /(sgl./ pl.) | extra-honorific /extra-polite — "you" 

In the highly educated / sophisticated family, the extra-honorific 
term / @pana / is the norm which is used to show respect to elders and 
superiors of the family or outside the family. The term is also used for 
strangers or unknown persons, high officials, doctors, engineers, 
professors, scientists and other technical and non-technical persons to 
show them proper honour and respect. 


It is to be mentioned here that the term / G@pana / is not a part of 
the Oriya personal pronominal inflectional system but for agreement 
between subject and verb, it takes the third-person (sgl. / pl.) verbal 
forms in the honorific sense like the personal pronouns while it is used 
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in the syntactic construction. The following examples in this connection 
may be referred to: 


/ se khaila / "he / she ate" non-honorific 
/ se khaile / "he / she ate” honorific 

/ semane khaile / "they ate" honorific 
/Gpana khdile / "you ate" honorific 

/ @pana mane khiaile / "you ate" honorific 

But not 

*/a@pana (mane) khaila/ —"you ate" 


Here, the person markers such as -@ and -e are used in the 
non-honorific and honorific sense respectively. 


Other Aspects of Oriya Personal Pronoun 


(t) The Social Contexts in which the Personal Pronouns occur 


| Personal Pronoun __| | Meaning _| Social Context 


First | mu (sgl.) {+ honorific] | General use 
ame {+ honorific] | Group identity 
(excl. / incl. pl.) 
ambhe (pl.) {+ honorific] | Exalted position 
(used in the singular 
sense) 
Second] tu (sgl.) [- honorific] Familiar use in the 


family, in the prayers 
and devotional songs, 
used for intimacy, 
love, affection and 
friendship, used for 
non-polite expressions 


teme ~ tame [+ honorific] | General use 

(sgl. / pl.) 

tume ~ tumbhe [+ honorific] | Used in written Oriya, 
(sgl. / pl.) used in the speeches 


of the orthodox 
Brahmins of Orissa 
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tui or tuhi (sgl.) [- honorific] Used in colloquial 
Oriya in the rural 
setting 
apana (sgl. / pl.) [+ extra- Used for politeness 
honorific] and honour, used in 
the sarcastic sense 
Third | e, ei, ie (sgl.) [- honorific] Used in the 


"this", wate 


amane (pl.) "these" 


sie, sei (sgl.) 
"he / she" 


se (sgl.) "he / she" 


[+ proximate] 


[+ honorific] 
[+ proximate] 


[- honorific] 
[+ non- 
proximate] 


[+ honorific] 
[+ non- 
proximate] 


derogatory sense | 


Used 1n the 
general sense 


Used in the 
derogatory sense 


Used in the 
general sense 


se mane (pl.) "they"} [+ honorific] Used in the 
[+ non- general sense 
proximate] 


Note: The binary division of proximate / non-proximate shows the 


demonstrative bases of the Oriya third-person pronouns. 


(ii) Caste and Status Difference 


The asymmetrical use of the personal pronouns shows the caste 
and status differences that have been existing in the Oriya speech 
community. 


Oriya speech community with its hierarchical order and 
structure has been classified into Savarna (Non-Scheduled Castes) and 
Asavarna (Schedules Castes). The Savarna type is further classified into 
Brahmins (the worshippers of deities), the Khandayat-Kshetriyas (the 
warrior class or caste), the Vaisyas (the tradesmen) and the Sudras (the 
serfs). 


The Brahmins, the Khandayat-Kshetriyas and the Vaisyas are 
placed in the higher ranks while the Sudras are placed in the lower 
ranks of the Oriya society. Besides this, the Scheduled Castes which 
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remain outside the sacred pale of the Oriya society, are placed in the 
lowest ranks of the caste hierarchy. 


The maintenance of the respective caste and status differences 
which is closely related with the terms of address and personal 
pronominal usage is reflected in the corpus of such traditional social 
structures of the Oriya speech community in spite of the growth of the 
inter-caste-cum-interpersonal relationships that have taken place in the 
Oriya society after Independence. 


Thus, the people of the higher castes and status use the 
non-honorific term / tu / and sometimes the honorific / tame / to address 
the people of the lower castes and status but in return they receive the 
most or the extra-honorific term / G@pana / and in a few cases, / teme 
apana / from them. This itself shows that the previous caste and status 
differences still persist at certain levels of Oriya society even after 
Independence. 


(iit) The Choice of Honorific / Non-Hononrific Term 


The choice of honorific / non-honorific pronominal terms is 
determined by the social relationships of the participants of the speech 
event. 


The usage of the personal pronoun and the address terms are 
correlated with the social status and caste attitudes of the people. This 
is especially found in the case of the address terms like surnames, first 
names and the diminutive first names of the addressees for which the 
personal pronouns are used. The following examples in this context may 
be noticed here. 


Address Status Addressee Personal Pronouns 
Terms Used 


(a) Surname Nayak baboo, tame, apana / 
Mishra babu etc. 


Nayak, / tu, tame / 
Mishra etc. 


(b) First Name | High Hari, Nari etc. /tume, tame / 


(c) Diminutive | Low Haria, /tu / 
First Name Naria etc. 
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These examples show how the choice of the honorific / 
non-honorific status of the Oriya people is determined on the prestige 
scale according to which the personal pronouns are used. 


(tv) Personal pronouns in the Imperative Construction 


It may be possible to introduce @ as non-honorific marker, -a 
and -antu as honorific and extra-honorific markers respectively which 
can be added to the verbal roots so as to make them imperative 
construction in Oriya in the second person. 


The use of second-person pronouns such as / tu, tame, tume, 
tumbhe / and / @ pana / shows the variation in degrees ranging from 
non-honorific (or non-polite) to extra-honorific (or extra-polite) which 
can be identified in the following imperative constructions. 


/tu dekh - @ / "you see" (Non-Honorific) 
/ tame dekh -a/ "you see" (Honorific) 
/ apana dekh - antu / "you see" (Extra-Honorific) 


Here the verbal forms / dekh -@, dekh -a, dekh - antu / play an 
important role in revealing the nature of second-person pronouns in 
Oriya. 


Further, the second-person pronoun may either be progressive 
or regressive. The former is marked in the case of the close intimacy 
between addresser and addressees while the aS | is marked when the 
distance is maintained between them. 


It may be further stated that a shift of the second-person 
pronoun from progressive to regressive and from regressive to 
progressive can be possible on the formal / non-formal ground 
according to the communicative needs of the people in the different 
speech-act situations. 


Thus: 


(i) the formal > non-formal = progressive 
[tame or tume or tur lie apana] > [tu]/ V + -a/-anu>V+-@ 
Examples 


/ tame dekh-a / 
|} / tu dekh @ /- "you see" 
/ apana dekhantu / 
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(11) Non-formal > formal = regressive 
[tu] > [tame or tume or tumbhe, Gpana]/V + @ > V + -a/-antu 


Examples 


/ tu dekh@ | >— / tame dekha / 
[ ] "you see" 


/apana dekhantu / 


The honorific / non-honorific (or polite / non-polite) expressions 
in the third person can be found in Oriya with honorific marker -antu 
and non-honorific marker -u in the following imperative constructions. 


/ se dekhantu / "let him / her see" Honorific 
/ semane dekhantu / "let them see" Honorific 
/ se dekhu / "let him / her see" Non-Honorific 


Besides this, the imperative constructions like: 


/ khaiba heu / "please take food" lit. "let eating be done" 
eating became 

/ khaiba heuntu / "please take food" lit. "let eating be done" 
eating became 


are conservative types of expressions of request forms which were once 
used to show high respect and honour to the VIP-s of ancient and 
medieval Orissa. This type of request was also used in the Oriya society 
to show honour to the superiors in the pre-Independent period but now 
it is disused in Oriya. 


(v) Power and Solidarity 


The asymmetrical tendency created due to the social distance 
between addresser and addressee can be very much felt at certain levels 
of Oriya speech community which is reflected in the usage of / Gpana / 
and / tu / but the usage of / tu / also brings forth a close intimacy among 
the people of Orissa. 


The form / @pana / indicates the power of the addressee, his high 
status in the society, his formal relationship with the addresser, while 
_ the form / tu / although indicating the addressee with lower status still 
will be more helpful to promote friendship and goodwill among the 
people for maintaining solidarity in the society. 
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Power and solidarity are two different aspects of the society. 
Power can be shown on the basis of the strength, knowledge, wealth, 
politics, profession, age, sex etc. which create asymmetry in the society. 


On the other hand, solidarity in the society can be achieved 
through the reciprocal use of the non-honorific form / tu /. 


However, the transition of power towards solidarity is a recent 
development which is remarkably noticed at certain levels of the 
modern Oriya society. 


Similarly, the frequent use of / apana / by the addresser and the 
addressee is another remarkable tendency of the modern Oriya society 
which shows the artificial relationship among the educated people of 
Orissa. 


To sum up, the following points may be noted. 
(i) The complex nature of the second-person pronouns 


(a) The non-honorific pronoun (fz) is used in informal talk for intimacy 
and non-respect while honorific (tame) and extra-honorific (@pana) 
are used in formal talk for juniors and seniors respectively. 


(b) Generally the mother is addressed by the pronoun (fz) for intimacy 
and the father is addressed by the pronoun (tame) for respect but 
the extra-honorific term (@pana) is also used for the father in the 
highly sophisticated family. 


(c) The choice for the use of the personal pronouns (tu, tame, apana) 
depends on the factors like caste, social status, education, age, sex, 
wealth, family background, etc. of the addressee. 


(d) The asymmetrical use of the Oriya personal pronouns shows the 
caste and status differences of the addressee. 


(e) The shift from progressive to regressive and vice versa according to 
the communicative needs of the addresser reveals his dynamic 
behaviour of both addresser and addressee in the speech-act 
situations. 


(f) The use of the pronoun (tu) for the addressee shows that he has 
low/no status and hence has no power in the society. While persons 
with power and status are attached to the extra-honorific pronoun 
(apana), the pronoun (tame) seems to be neutral in respect of the 
power and status. 
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(g) 


Lastly, the complex nature of the second-person pronoun with its 
three levels of use such as tu, tame, Gpana corresponding to the 
three possible verbal forms indicates the different degrees of 
politeness in the speech-act situation, and the gradation in the 
degrees of politeness is found only in the second-person and not in 
the first and third persons. 


(ui) The first person (pl.) pronoun (@me or Gmbhe) behaves like a 


single participant (i.e. the plural form acts as a singular form) and 
no other examples are available in Oriya where the singular 
personal pronoun acts as plural but we have (tame ~ tume and 
a@pana) pronouns which are used as both singular and plural. 


(iii) The third person pronoun with demonstrative base (proximate vs. 


remote) can be used in the honorific and non-honorific sense. 


Conclusion 


(i) There has been a progressive tendency to reject / disuse the old 


(ii) 


Oriya forms like mui, muhi, aimhe, ammhe etc. The pronominal 


. forms such as teme, teme Gpana etc. and some imperative / request 


forms such as khaiba heu / khaiba heuntu etc. of conservative 
nature are also discouraged to be used in modern Oriya. 


Similarly, the frequent use of @mbhe, tumbhe in old Oriya literature 
has been reduced to the maximum extent in present Oriya. 


The gradual change in the past attitude of the people with regard 
to the caste (Savama vs. Asavarna) while using personal pronouns 
in Oriya 

The low-caste people used the extra-honorific forms like apana / 
teme- apana for the people of the higher caste and status. This was 
the social norm for the low-caste people to use such terms to show 
respect and honour to the people of the higher caste and status in 
pre-Independent Onissa but things have been changed in respect of 
the use of these pronominal terms in post-Independent Orissa. 


The fact is that now the people of the higher caste and status do 
not hesitate to use the extra-honorific term / @pana / for the people 
of low-caste who have acted as top officials in the different offices. 
This tendency shows that the previous caste restriction which stood 
against maintaining the solidarity in the society have been 
abolished while using the extra-honorific term / @pana / by both the 
sections of the people. 
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(1) The gradual extension of the extra-honorific term / Gpana / is also 
very well marked in the formal talk of the educated and 
semi-educated people of Orissa while the uneducated Oriyas of 
rural base favour the non-honorific form / tu / but they also use the 
extra-honorific term / @pana / as and when required. 


This tendency shows that the educated people of Orissa want to 
maintain formality in their behaviour through the use of extra-honorific 
/ apana / and honorific / tame / in their speeches which is a sign of 
sophistication and aristocracy. 
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NOMINALIZATION IN MANIPURI 


CH. YASHAWANTA SINGH & H. SURMANGOL SHARMA 
Manipur University 


1.0. Nominalization is noticed in Manipuri as a process of forming a 
noun from a verb as well as the derivation of NP from an underlying 
clause. The Manipuri nominalizing morpheme is pa’ which has a 
number of cognates in other TB languages. Manipuri -pa reveals 
functional similarities with that of Lahu ve at least as a nominalizer and 
as a subordinator. Since the verbs in Manipuri are bound in nature, 
their citation forms in isolation are always with -pa. In many TB 
languages, a single morphological non-variant form is capable of 
carrying out the processes such as nominalization, relativization, 
genitivization, attribution, etc. Such a no simple relationship between 
form and function raises several issues during the grammatical 
characterizations of these languages. Forms in -wa in Chantyal (a 
Bodish language) performs multifarious functions (see Noonan, 1997 
for detail). The present paper specifically attempts to explore the 
various functions of the default nominalizer -pa, which is playing a 
significant role in the Manipuri morphology and syntax. The forms that 
are used as nominalizers other than -pa are pot ‘thing’, jat ‘kind/type’, 
etc. out of which the latter is a borrowed word from the Indo-Aryan 
(see Chelliah, 1997). In Manipuri, forms in -pa denote the functions as 
in: 


A. Lexical Nominalization 
B. Verb Complements 

C. Sentential Complements 
D. Subordinate Clause 


E. Conditional Clause 


1. -pd becomes -bd after a voiced scund. 
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F. Main Clause Nominalization © 

G. Relative Clause 

H. Ambiguous Construction: Nominalization vs. Relativization 
I. Finite verb Nominalization 


J. Attributive Adjectives 


2.0. Lexical Nominalization 


In this section, word level and phrasal level nominalizations are 
treated as the same because the latter after nominalization gives rise to 
a single unit though often orthographically represented as two words. 
Lexical nominalization is generally noticed when naming a person, 
occupational titfe or the person holding the occupation, etc. First of all, 
let us consider the Manipuri (Meitei) personal names, which are 
formed with the addition of -pa to a verb that denotes physical 
appearance, quality, etc. 


2.1. Indigenous Personal Names 


1) a. tomba 
ton-pa 
be youngest-NZR 


‘a male name (usually given to the youngest child)’ 


b. thoiba 
thoi-pa 
win-NZR 


‘a male name’ (the literal meaning is ‘the best of all’) 


c. cawba 
caw-pa 
be big-NZR 


‘a male name’ (the literal meaning is ‘one who is big’) 
2.1.2 Names of Occupational Title 


| -pa 1s also used when naming an occupation or the person 
related to that. 
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2) a. caphu sa-ba 
Ca-phu sa-pa 
rice-container make-NZR 


‘potter/the profession of pot-making’ 


b lat yon-ba 
lat yon-pa 
flower sell-NZR 


“florist/the profession of selling flowers’ 


Cc. séngom  yon-ba 
san-khom yon-pa 
cow-milk sell-NZR 
‘milkman/milkmaid/the profession of selling milk’ 


d. lai yek-pa 

lai yek-pa 

picture draw-NZR 

‘painter/painting (occupation)’ 
e. pham-naiba 

phom-nai-pa 

place-be-NZR 

‘a high-rank official’ 


2.1.3 Names of Traditional posts 


The following are the names of the Manipuri traditional posts; 
however, as of now, they are not in use. 


3) a. kayninba 
kody-lin-pa 
granary-establish-NZR 


‘treasurership / treasurer’ 


b. sdnlinba 
son-lin-pa 
house-establish-NZR 


‘a person in charge of executing criminals condemned 
to death / the post of this kind’ 
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2.1.4 Socio-cultural Terms 


It is noted there are socio-cultural terms developed through 
nominalization. Consider the following wedding related terms. 


4) a. nupi sennaba 
nupl sennda-pa 
woman take care-NZR 


‘a person who attends the bride at a wedding’ 


b. bor sennaba 
bor senna-pa 
bridegroom take care-NZR 


‘a person who attends the bridegroom at a wedding’ 


In the above examples, it is clear that the first constituents are, in 
general, nouns and the second constituents are deverbal nouns. 


2.2.0 Female Nominalizer -pi 


The origin of female nominalizer -pi has been traced to the stem 
pi ‘grandmother’; however, its use is not always restricted to indicating 
biological gender (see Chelliah, 2004 for detail). Consider the following 
examples where -pi signifies no gender at all. 


2.2.1 Naming of Traditional Games 


Names of Manipuri traditional games but not all are formed by 
nominalizing the verb with -pi as given below. 


5) phibul habi 
phi-pul _—hai-pi 
cloth-tie swing-NZRF 
‘name of a traditional game played by children’ 


6) caphu thugaibi 
ca-phu thugai-pi 
rice-container break-NZRF 


‘a blindfolded competition for hitting a mud pot’ 
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7) yubi lakpi 
yupt lak-pi 
coconut snatch-NZRF 


‘a rugby-like game played with a coconut’ 


8) laphu kabi 
laphu ka-pi 
bananatree climb-NZRF 


‘a competition for climbing a banana tree’ 


2.2.2 Naming of inanimate and animate 


The proper naming with -pi is also seen in other areas such as 
naming of things, animals, plants, persons. 


2.2.2.1 Objects with -pi 


9) tengot khonnaibi 
tengot khon-nai-pi 
metal bowl leg-be-NZRF 


‘a kind of metal bowl having a round support below 


10) yotsubi 
yot-su-pl 
iron-pound-NZRF 


‘iron trivet’ 


2.2.2.2 Plants with -pi 


11) teynaumanbi 
tennau-man-pi 
a kind of reed-resemble-NZRF 


‘a kind of creeper whose fruit is eaten as a vegetable’ 


12) hawai asanbi 
hawai d-San-pl 
bean Att-be long-NZRF 


‘a kind of long bean eaten as a vegetable’ 
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13) kobi mapansatpi 
kobi md-pansat-pl 
cabbage NM-bear bloom-NZRF 


‘cauliflower’ 


14) hawaimubi 
hawali-mu-pi 
bean/pulse-be black-NZRF 


‘a kind of broad bean’ 


2.2.2.3 Animals with -pi 
15) tharaikokpi 

th arai-kok-pi 

id.-lay egg-NZRF 


‘name of an aquatic insect’ 


16) thoaroi ninkhdibi 
tharoi ninkhai-pi 
shell crumble-NZRF 


‘name of a fresh water shell’ 


17) narakpi 
na-lak-pi 
fish-snatch-NZRF 
‘kingfisher’ 


18) anancabi 


anan-ca-pli 
child-eat-NZRF 


‘name of a green fleshy worm’ 


2.2.2.4 Specific Naming of an animal 


~C.Y. Singh & H.S. Sharma 


Other than the ones given above, it is also noted that the name of 
an animal usually a domestic one is sometimes given a name in 
accordance with its colour and the pattern of colours it has as in the 


following examples. 
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19) utmanbi 
ul-man-pl 
ash-resemble-NZRF 


‘name of a cat, cow, etc. having a greyish coat’ 


20) ananbi 
d-nan-pl 
Att-be red-NZRF 


‘name of a cow, goat, etc. having a reddish coat’ 


21) amubi 
d-mu-pl 
Att-be black-NZRF 


‘name of a cow, dog, cat, etc. having a black coat’ 


3.0 Verb Complements 


Forms in -pé are used as verb complements in Manipuri as in 
Chantyal with -wa (See Noonan, 1997). 


22) tomba na caba pammi 
tombe =—s__na ca-pa pam-i 
Tomba fish eat-NZR like-RL 


‘Tomba likes to eat fish’ (lit. Tomba likes eating fish) 


23) tomba kaithel catpa pamde 
tombs katthel Cat-pa pam-ta-i 
Tomba market go-NZR like-NEG-RL 


4.0 Sentential Complements 


A nominalized clause when embedded into another clause 
functions as a complement as in (24) and (25) below and are 
diagrammed with braces. 


24) ay {tombana ramda sel piba} khanni 
oy iomba-na ram-ta sel pi-pa khon-i 
I Tomba-NOM Ram-DAT money give-NZR_ know-RL 


’ ‘I know that Tomba gives money to Ram.’ 
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25) {nasi makhoi  lakpa} oythokte 
nasi ma-khoi  lak-pd oythok-té-i 
Today 3p-H.PL come-NZR_possible-NEG-RL 
‘It is not possible that they come (here) today.’ 


5.0 Subordinate Clauses 


Forms in -p@ along with different case markers enable us to 
construct various subordinate clauses. Concerning with the structure of 
a Manipuri subordinate clause reveals the fact that the whole clause 
first gets nominalized and as a continuing process respective case 
markers come to attach after the nominalizer. Different types of 
subordinate clauses are presented as follows: 


5.1 -po-Locative 


26) tomba  lakpada ramu__—_—ilaki 
tomba lak-po-ta ramu.—_lak-i 
Tomba come-NZR-LOC Ramu come-RL 
‘Ramu comes when(ever) Tomba comes.’ 


5.2 -po-Nominative 


27) non  cudabana lawmisin ware 
non  cu-td-pa-na law-mi-sin wa-le 
rain fall- NEG-NZR-NOM paddy field-person-PL be sad-PERF 
‘The farmers are sad as there has been no rain.’ 


5.3 -pa-Ablative 


28) laink konna pabodagi ma phars klas phanle 
lairtk kon-na pa-ps-tagi ma phars klas  phan-le 
book hard-ADV read-NZR-ABL he first class obtain-PERF 
‘He has passed in the first class due to his devotion to the studies.’ 


5.4 -pa-Associative 


29) tomba lakpaga ramu__—_—lakkani 
tomba _—_lak-pa-ka ramu__—_—lak-kani 
Tomba come-NZR-ASSO Ramu come-UNRL 
‘As soon as Tomba arrives (here), Ramu will come.’ 
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5.5 -pa-Genitive 


30) tomba ganja thokpagi pulisna phare 
tomba ganja thok-pa-ki pulis-na pha-re 
Tomba ganja smoke-NZR-GEN police-NOM arrest-PERF 


‘The policeman has arrested Tomba for smoking ganja.’ 


6.0 Conditional clause 


It is also interesting to note that a conditional expression in 
Manipuri is made through nominalization plus definitive suffix as in (31 
& 32) below in which the verb obligatorily carries the prospective suffix. 


31) tomba —_laklabadi ramu_—_lakkani 
tomba — lak-la-ps-ti ramu__—_lak-kani 
Tomba come-PROS-NZR-DLMT Ramu come-UNRL 


‘Ramu will come if Tomba comes.’ 


32) lairik tawnindrabadi skul catkanu 
laink  tau-nin-td-ra-po-ti Skul cat-ka-nu 
Book do-WISH-NEG-PROS-NZR-DLMT school go-POT-PROH 


‘Don’t go to school if (you) are interested in studies.’ 


7.0 Main Clause Nominalization 


In certain cases, Manipuri allows matrix clause nominalization 
whereby the relation between the subordinate and matrix is held by 
attaching -ni ‘copula’ to the matrix clause as in (33) below. 


33) a.tomba ganja _ thakpagi pulisna phabani 
tomba ganja _ thakpogi pulis-na pha-pa-ni 
Tomba ganja smoke-NZR-GEN police-NOM arrest-NZR-COP 
‘It is a fact that the police have arrested Tomba because he 


smoked ganja.’ 


b.tomba lakpadagi ramu___lotpani 
tomba lak-pa-tagi ramu__lot-pa-ni 
Tomba come-NZR-ABL Ramu _ hide-NZR-COP 


‘Since Tomba came (here) Ramu went into hiding.’ 
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8.0 Relative Clause 


Givon (1990: 646p) states the semantic definition of relative 
clause as "a relative clause codes a proposition one of whose 
participants is coreferential with the head noun that is modified by the 
clause". In Manipuri, too, one of the participants in the relative clause ts 
coreferential with the head noun and we are discussing here how the 
condition of coreferentiality between a participant in the relative clause 
and the head noun is syntactically maintained. Manipuri does not 
employ any separate grammatical or lexical relativizer. Matisoff (1973: 
363p) provides evidences from the selected Asian languages and even 
the English word ‘that’ using both as a nominalizer and as a relativizer, 
which shows that there is a deep connection between nominalization 
and subordination. And interestingly, Manipuri could also be 
considered as one of these languages where relativization is noticed as 
one specialized function of nominalization. In the following examples a 
relative clause is marked with square brackets. 

34) foygi  iconnupi luhonba] nupadu daktarni 
oy-kt = Con-nupl luhon-pa nupa-tu dakKtar-ni 
I-GEN my sister-woman marry-NZR man-DET doctor-COP 
‘The man who married my sister is a doctor’. 


The above (34) is derived from the following clauses: 


a.Main clause: nupadu daktarni 
nupa-tu daktar-ni 
man-DET doctor-COP 


‘The man is a doctor.’ 


b. Subordinate clause: 
nupaduna oygi iconnupt luhonni 
nupa-tu-na si-ki — icén-nupi luhon-i 
man-DET-NOM I-GEN my sister-woman marry-RL 


‘The man married my sister.’ 


It has been clear from the above illustration that the relativized 
head noun has a coreferent in the relative clause. Concerning with the 
relative position of a relative clause in Manipuri, it is prenominal. It will 
: be too early to say that there is no postnominal relative clause at all but 
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the general tendency reveals that the head noun preceded relative 
clause loses the legitimacy of being a relative clause, which is evidenced 
from the following ungrammatical construction. 


35) *nupadu ss [digi ~—s icannupi luhonba] daktarni 
nupa-tu _al-ki icon-nupi | luhon-pé daktar-ni 
man-DET I-GEN mysister-woman marry-NZR_ doctor-COP 


We can reconfigure (35) by permuting the head noun along with 
its predicate before the whole relative clause, then, we get a 
grammatical sentence as in (36) below. 


36) nupadu daktarni [digi icannupi luhonba] 
nupa-tu daktar-ni [ol-ki — icon-nupi luhon-pa] 
man-DET doctor-COP I-GEN mysister-woman marry-NZR 


‘The man who married my sister is a doctor.’ 


As in other languages, any core argument can function as a 
relative head noun. Let us consider the following examples. 


37) ramuda laink piba nupadu sire 
ramu-ta lairik pi-pa nupa-tu si-le 
Ramu-DAT book give-NZR man-DET die-PERF 
‘The man who gave a book to Ramu has died.’ 


38) ramuna lairik piba nupadu _ Sire 
ramu-na laink pi-pa nupa-tu si-le 
Ramu-NOM book give-NZR man-DET die-PERF 


‘The man whom Ramu gave a book has died.’ 


39) ramuna piba lairiktu segayre 
ramu-na pi-pa laink-tu segay-le 
Ramu-NOM give-NZR book-DET tear-PERF 


‘The book Ramu gave has been torn.’ 


In English and many other languages, a particular relativizer is 
chosen in terms of the syntactic function of the relative head noun, for 
instance, the distinction between who versus whom, who versus what, : 
etc. In addition to this, spoken Lhasa Tibetan uses four distinct relative’ ~ 
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markers, such as mkhan, sa, yag and pa and out of them which marker 
should be chosen is determined by the semantic role of the head NP in 
the relative clause, and by the time reference of the relative clause 
(DeLancey, 1986). For the languages using only one relative marker as 
in Manipuri cannot make differences as the Lhasa Tibetan relative 
markers do in the relative clause. Therefore, Manipuri follows the 
criterion as the head NP and anyone of non-head NPs in the relative 
clause must not refer to the same entity. The cas¢ markers which the 
non-head NPs carry determine their functions, so enable us to decide 
upon the semantic role of the head NP in the relative clause. In a 
situation where either NP could function as an actor or patient and 
even case marking is not obligatory for the actor-patient distinction, the 
head NP acts as an actor and the non-head NP as a patient in the 
relative clause. Consider the following examples. 


40) a. mi hatpe kay 
mi hat-pa kay 
person kill-NZR_ tiger 
‘The tiger that killed a man’ 


b.kay = hatpa mi 
kay _hat-pa mi 
tiger kill-NZR person 
_ ‘The man who killed a tiger’ 


On the other hand, when a non-head NP in the relative clause 
functions as an actor, it is obligatorily marked with nominative case as 
in (41) below. 


41) kayna hatpa mi 
kay-no hat-pa mi 
tiger-NOM kill-NZR_ person 
‘The man whom a tiger killed’ 


An interesting point we could add here 1s the use of -pi ‘female 
aominalizer’ as a relativizer remains as an option against the use of -pa 
nominalizer’ if the relativized head noun is a female. For instance: 


42) [mawa hatpiJpc nupiduyy, — pulisna phare 
méa-wa hat-pi — nupi-tu pulis-na phd-le 


3p-husband kill-NZRF woman-DET police-NOM §arrest-PERF 
‘The police have arrested the woman who killed her husband.’ 
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However, the use of -pi is not obligatory as it could be replaced 
with -pa. 


9.0 Ambiguous construction: nominalization vs relativization 


43) a. [tombana lairik piba/pc  thoibanyy = khandeyg cy. 
tombba-na laink pi-pa thoiba khan-ta-t 
Tomba-NOM book give-NZR Thoiba  know-NEG-RL 
‘(I) don’t know Thoiba whom Tomba gave a book.’ 


b. [tombana lairik pibaJjp  thoiba_ khandey cy, 
tomba-na laink  pi-pa thoiba  khan-ta-t 
Tomba-NOM book give-NZR Thoiba know-NEG-RL 
‘Thoiba didn’t know that Tomba gave a book (to x).’ 


There is an internal compositional distinction between 
relativization and nominalization even though both can be structurally 
assigned to NPs as shown below. 


Relativization: [[Relative clause]NP,,] > NP 
Nominalization: verb/clause 5p + NP 


10.0 Finite Verb Nominalization 


When a finite verb is nominalized with -pa, one can access to the 
hidden additional information such as contrary to expectation in (44), 
firm statement in (45), seeking confirmation in (46 & 47), etc. 


44)nen mnaran skul  catleba 
nan naran skul _ cat-le-pa 
you yesterday school go-PERF-NZR 


‘(Do you mean it) you went to school yesterday.’ 


45) ay hayen sdidana lakkaniba 
oy hayen sol-ta-na lak-kani-pa 
I tomorrow be erroneous-NEG-ADV come-UNRL-NZR 


‘(Don’t worry) I will definitely come tomorrow.’ 
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46) non — cak careba 
non cak ca-le-pa 
you cookedrice eat-PERF-NZR 


‘(Are you sure) you have taken your meal?” 


47) tombadu naran aykhoida lakpraba 
tombo-tu naran al-khot-ta lak-pa-ld-pa 
Tomba-DET yesterday I-HU.PL-LOC come-NZR-INT-NZR 
‘(I am asking you) whether Tomba did come to our 


home yesterday.’ 


11.0 Attributive Adjectives 


Adjectives in Manipuri are formed by the addition of attributive 
é- and nominalizer -pa to the adjectival verbs while in Rawang 
‘adjectives take the nominalizer we in citation, but when used as 
predicates function the same as other intransitive verbs’ (LaPolla, 
2006). The prefixation to the adjectival verbs depends on the conditions 
that the verb must not take any other derivational suffixes and any 
lexical/grammatical element capable of further modifying the adjective 
or the whole noun phrase. For example, we cannot have the forms such 
as *a-pha-ra-ba, *yamna a-pha-ba, etc. However, once a- 1s deleted the 
given forms become quite grammatical and acceptable as pha-ra-ba 
ayan ‘the boy who has become good’ and yamna pho-ba anan ‘a very 
good boy’ (See Imoba, S.S. & Sarbayit, L.S., 2002). The discussion here 
focuses on the assumption of grammatical legitimacy of adjectives in 
Manipuri are nominals modifying nouns. Let us consider the following: 


48) a. aphaba mi ani thabak khalle 
d-pha-pa mi q-nl thabak  khal-le 
Att-be good-NZR person Att-two job appoint-PERF 


‘Two competent candidates have been appointed.’ 


b. aphaba ; ant thabak khalle 

aphéps —aeni._—sthabak  khal-le 
Att-be good-NZR_ Att-two job appoint-PERF 
‘Two competent candidates have been appointed.’ 
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In (48b) the nominal attribute can stand alone without its head 
noun. In such a situation, the sensible meaning is captured from the 
context of use. There are other instances where the so-called adjectives 
emerge as stand alone nouns functioning as arguments, which, in turn, 
could be considered as evidences for our assumption that adjectives in 
~ Manipuri are nominal attributes. 


49) a. oy acumba pammi 
oy d-cum-pa pam-i 
I Att-be true-NZR_ like-RL 
‘T love truth’ 


b. akanbana acumba Saki 
d-kan-ba-na a-cum-ba Sak-1 
Att-be strong-NZR-NOM Att-be true-NZR  sing-RL 
‘Might is right.’ 

c. awadba khondaba moca 
d-wd-ba khon-ta-pa mo-ca 


Att-be sad-NZR know-NEG-NZR NM-son/daughter 
‘a person/child who does not know what sufferings are (in life)’ 


Also note that the attributive prefix o- can function as a 
nominalizer even without -pa. However, the construction definitely 
results into a compound noun consisting of at least two verb stems as in: 


50) a. awa ana 

d-wa d-na 
Att-be sad Att-feel pain 
‘sufferings’ 

b. ain asa 
a-1 9-Sa 
Att-be cold Att-be hot 
‘weather/climate’ 

c. acum aran 
a-cum d-ran 


Att-be correct Att-be wrong 


‘justice’ 
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d. athum ohao 
d-thum d-hao 
Att-be tasty Att-be sweet 


‘varieties of sweetmeat’ 


12.0 Conclusion 


After the investigation into the different aspects of -pa 
constructions, we have arrived at this point that -po fulfils a vast number 
of grammatical functions. As Chantyal -wa ‘nominalizer’ -pa is a single 
grammatical entity regardless of its many uses. In this language, at least, 
nominalization cannot be considered merely as a process of forming a 
noun from a non-nominal element. The other important grammatical 
processes such as subordination, relativization, the formation of 
attributes, etc. are all seen as the different functions of nominalization. 


Abbreviations 

3p Third person ABL Ablative 

ADV Adverb marker ASSO Associative 

Att Attributive DAT Dative 

DET Determiner DLMT Delimitative 

GEN Genitive HPL Human plural 
M.CL Main clause NEG Negative 

NH Head noun NOM Nominative 

NZR Nominalizer NZRF Female Nominalizer 
PERF Perfect PL Plural 

POT Potential PROH Prohibitive 

PROS Prospective RC Relative clause 
RL Realized aspect UNRL Unrealized aspect 
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WORD FORMATION IN DIMASA 


AJIT KUMAR BAISHYA 
Assam University, Silchar 


1. Introduction 


Many linguists consider word formation processes to be the most 
important aspect of linguistic study of any language. Word formation 1s 
one of the major issues in morphology and it deals with the construction 
of new but related words. It is concerned with those words that consist 
of more than one morpheme. So, by the additions of certain affixes to 
the root, we can derive different classes of words to convey the 
intended idea. Of course, it is a rule-governed behaviour and it follows 
the rules of the particular language. According to Crystal (1985), the 
term word formation refers to the whole process of morphological 
variation in the constitution of words, i.e. including the two main 
divisions of Inflection and Derivation. In a more restricted sense, word 
formation refers to the latter processes only. Natural languages like 
Dimasa are found to use two main types of strategies, mainly affixation 
and compounding in order to derive a new word. At the same time, 
other processes like borrowing, reduplication are also used in Dimasa 
to form new words to meet the ever-growing needs of the Dimasa native 
speakers in their day-to-day life. 


2. The Language 


Dimasa is the name of the community as well as the name of the 
language, which is spoken by a group of people belonging to the 
Mongoloid race. Linguistically, Dimasa belongs to the Bodo-Garo 
group of the Tibeto-Burman family of languages, said Benedict (1972). 
They are also known as Hill Kachari or Dimasa Kachari (Grierson, 
1903). Dimasa language is spoken mainly in the North Cachar Hills of 
Assam. The Dimasas are also found in Cachar, Karbi Anglong and 
Nowgaon districts of Assam and in Dimapur sub-division of Nagaland. ~~.’ 
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According to 1991 census, the total population of the Dimasa speakers 
in Assam is 1,65,184. 


The word Dimasa literally means ‘children of the great river’ and 
the Ahom, the erstwhile rulers of Assam, called them Timisa, which is 
clearly a corrupted version of the word Dimasa. It is believed that 
Dimasas ruled Assam for almost 600 years prior to the advent of the 
Ahoms to Assam. But the Ahoms who came to Assam from present day 
Thailand under the leadership of Sukapha in A.D. 1228 defeated the 
Dimasas and as a result of it, the Dimasas shifted their kingdom to 
different places in the North East, which were not under Ahoms rules 
at that point of time. The historical chronicles of Assam mentioned that 
the Ahom army defeated the Dimasas and destroyed their capital city 
Dimapur as well as killed the Dimasa King Dersong in A.D. 1536. To 
avoid further humiliation at the hands of the Ahoms, the Dimasas 
shifted their capital to Maibong in North Cachar Hills of Assam and 
later it was shifted to Khaspur near Silchar, Assam. These historical 
events clearly indicate that the Dimasas came into close contact with 
different groups of people at different points of time and the contact is 
maintained till today as the Dimasas are living peacefully with the 
Bengalis in Barak Valley and with Assamese and other communities in 
other parts of Assam. 


The Dimasas are basically born bilinguals. Besides their mother 
tongue, the Dimasas of Cachar district situated in Barak Valley speak 
fluent Sylheti, a regional dialect of Bengali as it is the language variety 
spoken by the majority and the Dimasas living in other parts of Assam 
speak Assamese. Since all the Dimasa speakers are bilinguals and 
Dimasa, which is a minority language in the North East, exists side by 
side with Assamese and Sylheti Bengali in Assam, it is quite natural that 
the majority languages like Assamese and Bengali will have some 
influences on word formation processes in Dimasa. The influence can 
be seen especially in the area of borrowing. 


3. Affixation 


It is a collective term for the types of formative that can be used 
only when added to another morpheme (root or stem), 1c. affixes are a 
type of ‘bound’ morpheme (Crystal, 1985). Affixation is a very 
productive morphological process as far as word formation in Dimasa 
~ is-concerned. Though three types of affixes viz. Prefix, Infix and Suffix 
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are found in usage in different languages of the World, Dimasa makes 
use of only two - Prefix and Suffix. 


3.1 Prefixation 


Dimasa uses a number of prefixes like /da-/, /a:-/, /g-/ to derive 
different word classes. They are explained below with the help of 
examples. 


/da-/ when this prefix is attached to the verbal root, it shows 
prohibition as well as negation: 


/ji/ ‘eat’ /da + ji/ ‘don’t eat’ 
/mlao/ ‘play’ /da+mlao/ ‘don’t play 
/neh/ ‘pull’ /da + neh/ ‘don’t pull’ 
/dugur/ ‘bath’ /da+dugur/ ‘don’t bath’ 
/rep/ ‘write’ /da + rep/ ‘don’t write’ : 


/a:-/ acts as the first person singular as in the following: 


/ma:/ ‘mother’ /a: + ma:/ ‘my mother’ 
/zu/ ‘grandfather’ = /a: + zu/ ‘my grandfather’ 
/bi/ ‘sister’ /a: + bi/ ‘my sister’ 


Adjectives in Dimasa are derived from verb by prefixing /g-/ as 
exemplified below: 


/ja:m/ ‘finish’ /g+ja:m/ ‘old’ 
/de/ ‘win’ /g+ de/ ‘big’ 


3.2 Suffixation 


Dimasa language uses quite a large number of suffixes in order 
to create new word classes. In fact, Dimasa uses more suffixes than 
prefixes. Few of the suffixes are mentioned below with ta help of 
examples. 


Suffix /-du/ is added to the verb in Dimasa to make it a 
continuous one to indicate an ongoing process like the following: 


/ji/ ‘eat’ /ji + du/ ‘eating’ 
/mlao/ ‘play’ /mlao+du/ ‘playing’ 
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plural. 


/bir/ ‘fly’ _/bir + du/ ‘flying’ 
/lem/ ‘throw’ /lem + du/ ‘throwing’ 
/neh/ ‘pull’ /neh + du/ ‘pulling’ 


Suffix /-ka:/ functions as the past tense marker in this language. 


/ji/ ‘eat’ [jt + ka:/ ‘ate’ 
/mram/ ‘smell’ /mram+ka:/ ‘smelt’ 
/di/ ‘carry’ /di + ka:/ ‘carried’ 


Suffix /-rao/ is added to a human noun in Dimasa to make it a 


/ansa/— ‘child’ /ansat+rao/ ‘children’ 
/mala/ ‘damsel . /mala+rao/  ‘damsels’ 
/naga/ ‘bachelor’ /naga+rao/ ‘bachelors’ 


¢ ? 


/miya:/ ‘man 


> 


/miya:+rao/ ‘men 


Suffix /-ya:/ acts as a negative marker in Dimasa. It can be added 


to the adjectives as well as to the verbs like the following. 


/kasi/ ‘small’ /kasi + ya:/ ‘not small’ 
/sim/ ‘black’ /sim + ya:/ ‘not black’ 
ful ‘high’ [ju + ya:/ ‘not high’ 
/ham/ ‘good’ /ham + ya:/ ‘not good’ 
/nu/ ‘See /nu + ya:/ ‘not see’ 
/kna/ ‘hear’ /kna + ya:/ ‘not hear’ 
/jon/ ‘walt’ jon +ya:]/ ‘not wait’ 
/kam/ ‘sit /kam + ya:/ ‘not sit’ 


Suffix /-ya:/ is also used in Dimasa as an agentative suffix to 


derive an agentive noun from a verb. 


/bai/ ‘dance’ /bai + ya:/ ‘dancer’ 
/brai/ . ‘buy’ ) /brai + ya:/ ‘buyer’ 
/dim/ ‘protect’ /dim +ya:/ ‘protector’ 
jjuru/ ‘help’ jurut+ya:/ ‘helper’ 


/dothai/ = ‘kill’ /dothai+ya:/ ‘killer’ 
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4. Compounding 


Compounding is a word formation process in which a linguistic 
unit is formed with at least two roots. It basically means that complex 
words are formed from smaller units which otherwise can function 
independently. Several types of compounds are found in Dimasa. A few 
of them are mentioned below along with examples. 


4.1 Noun+ Noun 


/bon/ ‘tree’ /pan/ ‘plant’ /bon + pan/ ‘tree’ 
/mai/ ‘paddy = /ron/ ~— ‘seed’ /mai +ron/ ‘rice’ 
/bar/ ‘air’ /di/ ‘water’ /bar + di/ ‘climate’ 


/gakran/ ‘blue’ /gisim/ ‘black’ /gakran + gisim/ ‘deep blue’ 


/sain/ ‘sun’ /bar/ ‘flower’ /sain + bar/ ‘sunflower’ 
/dao/ ‘bird’ /di/ ‘egg’ /dao + di/ ‘bird’s egg’ 
/ta/ ‘under- —_/tat/ ‘fruit’ /ta + tai/ ‘potato’ 
ground 
root 
4.2 Noun + Adjectives 
/ha/ ‘sol’ /zu/ ‘high’ /ha + zu/ ‘hill’ 
/mel/ ‘meeting’ /ma/ ‘big /mel+ma/ ‘conference’ 
/ri/ ‘cloth’ /sa/ ‘small’ /ri+ sa/ ‘towel’ 
4.3 Verb+ Noun 
/rep/ ‘write’ /hoza/ ‘leader’ = =/rep+hoza/ ‘editor’ 
/pori| ‘read’ /dnsa/ ‘boy /pon'+ ansa/ ‘student’ 


5. Borrowing 


Borrowing takes place when a speech community comes into 
close contact with other communities and as a result of this contact, one 
community starts taking and using some linguistic items from the other 
communities. Borrowing not only depends on contact but also on ° 
bilingualism. A large number of bilinguals ina particular speech 
community make borrowing very common and all pervasive. The 
borrowing always moves from the direction of the majority to minority 
language. In other words, the borrowing becomes one sided. It happens _ 
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either because of the prestige factor that is usually associated with a 
dominant language or it can be a need-fulfilling exercise. 


Dimasa, being a Tibeto-Burman language, has vocabulary largely 
of the Tibeto-Burman origin. But it has borrowed a lot of words from 
other languages as the Dimasa speakers came into contact with these 
groups of people as we mentioned above. Dimasa has played the role of 
a recipient language in Assam. Dimasa has borrowed innumerable 
lexical items from different languages covering almost all the semantic 
fields. Assamese and Bengali donated more words to Dimasa than any 
other language and this kind of borrowing is called Jntimate Borrowing 
by Bloomfield (1935). Phonological borrowing also took place in 
Dimasa. For example, Dimasa does not have aspirated stops, but the 
inclusion of borrowed lexical items having aspirated sounds from 
Assamese and Bengali into the Dimasa system has resulted in the 
acceptance of thése borrowed sounds as their own native sounds. Of 
course, it is not wide spread, only a handful of them are borrowed. 
Besides the development of aspirated stops in Dimasa as part of 
phonological borrowing, a large number of Assamese-Bengali words 
are borrowed and they are gradually replacing the original Dimasa 
words such as the following: 


Dimasa Assamese/ Bengali 

/grao/ /bha:sa:/ ‘language’ 
/rinmili/ /sonskniti/ ‘culture’ 
/repgon/ /kolom/ ‘pen’ 
/bubli/ /somot/ ‘time’ 
/risa/ /gamosa/ ‘towel’ 
/laizama/ /sitht/ ‘letter’ 


Apart from the above-mentioned examples, we also got some 
Indo-Aryan, to be more specific, Assamese-Bengali words, which have 
altogether displaced their original, native Dimasa counterparts and as a 
result of it, no Dimasa native speakers irrespective of their age, sex and 
profession could give us the original native words for the following: 


. Assamese/Bengali 
/bostu/ “goods, things’ 
/katari/ ‘knife’ 
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/sola/ ‘shirt’ 

/mel/ ‘meeting’ 
/koina/ ‘bride’ 
/khoros/ ‘expenditure’ 
/andar/ ‘dark’ 

/sini/ ‘sugar’ 
/ka:ron/ ‘reason’ 
/kobi/ ‘cabbage’ 
/saza/ ‘punishment’ 
/hisap/ ‘accounts’ 


It means that Dimasa has lost the native terms for the 
above-mentioned lexical items forever and we should not forget that 
this list is not exhaustive. Shall we then call it the first sign of language 
decay as far as this particular language is concerned? Only time will tell 
us. 


5.1 Assamese loan words 


The Dimasas came into contact with the Assamese speakers 
during the Ahom regime as we mentioned earlier and hence many 
Assamese words have entered the Dimasa vocabulary. The borrowed 
words have become an integral part of the Dimasa lexicon. 


Dimasa Assamese 

/katari/ /kotari/ ‘knife’ 

/sola/ /sola/ ‘shirt’ 

/gaker/ /gakhir/ ‘milk’ 

/mekla: sador/ — /mekhela: sador/ ‘a kind of dress worn by ladies’ 
/pita:/ /pitha:/ ‘sweets made from rice powder’ 
/khobor/ /khobor/ ‘news’ 

/ron/ /ron/ ‘colour’ 


5.2 Bengali loan words 


The Dimasas came into close contact with the Bengalis after they 
shifted their capital first to Maibong, North Cachar Hills and then to. 
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Khaspur, which is situated near Silchar, the Head quarter of Cachar 
district. The Bengali Brahmins started exerting considerable influence 
in all the affairs of the Dimasa state at Khaspur, when Bengali became 
the official language of the Dimasa state. Cachar district is heavily 
dominated by the Bengalis and Sylheti, a dialect of Bengali is the most 
common mode of communication in this region. As a result of this 
contact with the Bengalis, a huge stock of Bengali words entered the 
Dimasa lexicon. 


Dimasa Bengali 
/ba:ga:n/ /ba:ga:n/ ‘garden’ 
/a:r/ /a:r/ ‘and’ 

/sima:/ /sima:/ ‘boundary’ 
/ka:na:/ /ka:na:/ ‘blind’ 
/phoila:/ /poila:/ ‘first’ 

/zinis/ /zints/ ‘things, goods’ 
/ita:/ /ita:/ ‘brick’ 


53 Perso-Arabic loan words 


The available historical records of Dimasas did not mention any 
kind of contact with the Arabic language and speakers. Though few 
words of Perso-Arabic origin are found in Dimasa lexicon, it seems that 
Dimasa did not directly borrow those words but they entered Dimasa 
vocabulary through Assamese or Bengall. 


/dukan/ ‘shop’ 
/ukil/ ‘lawyer’ 
/jila:/ ‘district’ 
/b onduk/ ‘oun’ 


5.4 European loan words 


Like any other languages of the world, Dimasa has also 
borrowed a large number of European words, especially, English words 
to fulfil the various needs necessitated by industrialization, urbanization 
and modernization. Education also plays a major role for the borrowing 
to take place in Dimasa language. So, words related to the field of 
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industry, education, modern and urban lives were heavily borrowed 
from English and these words became an integral part of the Dimasa 
lexicons. Few words are mentioned below. Of course, the English words 
are modified to suit che phonological system of Dimasa language: 


/kompani/ ‘company’ /botol/ ‘bottle’ 
/philim/ ‘film’ /pensil/ . ‘pencil’ 
/mesin/ ‘machine’ /seken/ ‘second’ 
/tibhi/ ‘television’ /pamsa:r/ ‘puncture’ 
/iskul/ ‘school’ /tektor/ ‘tractor’ 
/kolej/ ‘college’ /mastor/ ‘master’ etc. 


The above-mentioned loanwords are not pronounced by the 
Dimasas in the original way. They have modified the pronunciations to 
suit the phonological and morphological system of the Dimasa 
language. In other words, the borrowed lexical items are nativized. 


6. Reduplication 


Reduplication stands for repletion of all or a part of a lexical 
item carrying a semantic modification. Reduplication, thus, can be 
partial or complete (Abbi, 1992). So, it is a process of repetition 
whereby the root word is repeated either completely or partially. 
Reduplication in Dimasa can be classified into two major types, viz., 
complete reduplication and partial reduplication. In complete 
reduplication, the whole form is repeated without any change whereas 
in partial reduplication, an expression is repeated partially. 


6.1 Complete Reduplication 


/sonte + sonte/ ‘short’ 
/gidi + gidi/ ‘big’ 
/kase + kase/ ‘small’ 
/lailo + lailo/ ‘easily’ 
/kere + kere/ ‘slowly’ 
/prik + prik/ ‘silently 


/gibin + gibin/ ‘differently’ 
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6.2 Partial Reduplication 


/bale + le/ ‘thinly’ 
/mili + i/ ‘smoothly’ 
/stepble + ble/ ‘sticky’ 
/rejente + te/ ‘lightly 


7. Echo Formations 


An echo word has been defined as a partially repeated form of 
the base word - partially in the sense that either the initial phoneme 
(which can either be a consonant or vowel) or the syllable of the base is 
replaced by another phoneme or another syllable (Abbi 1992). The 
echo word has no meaning on its own and the whole construction 
expresses the meaning of ‘etcetera’, ‘things similar to’ or ‘associated 
with that’ (Abbi 1992) to the base word or first word. 


/mairon/ ~  /mairon mairan/ ‘rice etc.’ 

/muli/ ~  /muli mula/ ‘medicine etc.’ 

/laisi/ ~  /laisi laisa/ ‘book etc.’ 

/guru/ ~  /guru gara/ ‘sugarcane etc.’ 

/ju/ ~ fjuja/ ‘rice beer and things similar to’ 
/di/ ~  /di da/ ‘water etc.’ 

/sa/ ~ /sasu/ ‘tea etc.’ 

/na/ ~ /nanu/ ‘fish etc.’ 

/nesa/ ~  /nesa nesu/ ‘intoxication etc.’ 

/ita/ ~~ fila itt ‘brick and things similar to’ 
/tas/ ~  /tas tus/ ‘playing card etc.’ 


The above-mentioned examples clearly indicate the rules of echo 
formation in Dimasa. If the base word contains /a/ in the final syllable, 
then the /a/ will be replaced by /u/ in the echo word. If it is any other 
vowel in the base word, it will be replaced by /a/. 


8. Conclusion 


From the above discussion on word formation in Dimasa, it has 
been seen that Dimasa makes use of all the known techniques like 
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affixation, compounding, reduplication etc. found in South Asian 
languages for deriving new words. The present study shows that 
Dimasas, being the speakers of a minority language in Assam, are 
always in pressure to maintain their mother tongue. As a result of it, 
Dimasas are trying to use the language in all the domains despite the 
pressure exerted on it by the Assamese and Bengali language. For this 
purpose, they need a large number of lexical items related to different 
semantic fields. In order to make it possible or to fill up the gaps 
Dimasas are borrowing a lot of words from different languages 
belonging to the other language families. Though Dimasa has retained 
most of the Tibeto-Burman features but at the same time, it has also 
gained many phonological, morphological and syntactic features from 
Assamese and Bengali languages. Apart from this, Dimasa also uses 
other techniques as mentioned above to fulfil the different types of 
needs felt by the speakers at various point of time so that the native 
speakers can use the Dimasa language for all practical purposes. In 
other words, Dimasa language employs all the known techniques to 
form new words so that it can effectively be used for all the domains of 
day-to-day usage. 
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ABSTRACT 


Multiword expressions (MWEs) create serious problems for many 
areas of language technology namely, language abstraction, query 
answering, machine translation, information retrieval etc. They are 
generated in various ways and their interpretations are different for 
different languages. MWEs include nominal compounds, phrasal verbs, 
idioms and collocations, as well as some other less easily classified 
examples. In this paper we deal with the analysis of multiword expressions 
in Bangla. Though the problem is quite profound, no study has been done 
on Bangla MWEs except for isolated works on re-duplicated words, echo 
words, onomatopoetic words etc. Even for these cases, no computer-based 
analysis has been reported so far. Here we. are concerned with 
characterizing MWEs and detecting them within a Bangla sentence. For 
this purpose, at first we have collected different types of MWE and then 
tried to find general rules of categorizing as well as identifying them. This 
categonzation can lead to their primary parsing, which in tum, may be 
used in more sophisticated language technology problems. 


1. Introduction 


Multiword expressions (MWEs) are acknowledged to be a 
serious problem in many areas of language technology. If several 
consecutive words in a sentence behave like a unified syntacto-semantic 
entity, they together form a Multiword Expression (MWE). In other 
words, MWE is a subset of cohesive lexemes that cross single word 
boundaries. This peculiar behaviour makes standard methodologies of 
computer-based natural language processing (NLP) inadequate. for 
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MWE processing. In Jackendoffs view (Jackendoff, 1997), the 
magnitude of this problem is far greater than has traditionally been 
realized, since the number of MWEs in a speaker or writer’s lexicon is 
of the same order of magnitude as those of single words. For example, 
in English WordNet 1.7 (Fellbaum, 1998), 41% of the entries are of 
multiword category. For a wide coverage of linguistic constructs, this is 
most likely an underestimate. 


Since MWEs are generated in various ways and_ their 
interpretations are different for different languages, a_ special 
component of language-specific processing is needed in problems like 
language abstraction, query answering, machine _ translation, 
information retrieval etc. MWEs include nominal compounds, phrasal 
verbs, idioms and collocation, but also less easily classified examples 
such as ‘by and large’, ‘ad hoc’, ‘in line’, ‘such as’ etc. Some of them are 
completely fixed, e.g. ‘ad hoc’, while some, like ‘tiffin box’, which is 
formed by combining two nouns, can be more flexible. 


The first difficulty with MWEs is that if they are treated by 
general compositional method of linguistic analysis, there is an 
over-generation problem. The second difficulty is the so-called 
‘idiomacity problem in semantic domain. On the other hand, 
syntactically idiomatic MWEs also exist in the language. They lead to 
parsing difficulty, due to non-conformance with patterns of word 
combination, as predicted by the grammar. 


An appreciable amount of work has been done on the analysis of 
MWEs in European Languages for the development of a better 
language model for various practical applications. However, since early 
nineties, researchers like Nunberg (Nunberg, 1994), Copestake 
(Copestake, 2002), Jackendoff (Jackendoff, 1997) are trying to make 
better understanding of the diverse kinds of MWEs. Sag et al. (Sag, 
2002) perhaps formed the most influential linguistic treatment of such 
structures, who coined the terms institutionalised phrases (items with 
high co-occurrence frequency) and lexicalized phrases. Such division of 
MWE:s appears to be universal over languages. Lexicalized phrases can 
have partially idiosyncratic syntax or semantics or contain "words" 
which do not occur in isolation, such as ‘ad hoc’. Institutionalised 
phrases are syntactically and semantically compositional, but occur in 
high frequency in the language corpus. An example of this category is 
‘telephone booth’. In general, it is not always possible to make a clear 
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distinction between compositionality and non-compositionality, but one 
can think of a gradation of different levels between the two extremes. 


The lexicalized phrases are again subdivided into three groups: 
fixed expressions, semi-fixed expressions and syntactically flexible 
expressions. On the other hand, Baldwin et al. (2003) have divided the 
MWEs into three semantic classes: non-decomposable, idiosyncratically 
decomposable and simple decomposable MWEs. They are explained in 
section 2.3 of this paper. 


Different statistical association/co-occurrence measures have 
been suggested in literature for the identification of MWEs. Some of 
those are Mutual Information (Church and Hanks _ 1989), 
Log-Likelihood (Dunning, 1993), cohesiveness (Dias, 2003) etc. Dias 
presented an automatic approach for extracting ‘multiword units’ that is 
based on cohesiveness between words and parts of speech (POS) tags. 
Piao et al. (Piao, 2003) seemed to accept MWEs according to the 
co-occurrence frequency. Integration of all these statistical measures 
should provide better evidence for recognition MWEs. Recently, 
Venkatapathy and Joshi (Venkatapathy and Joshi, 2005) tried to solve 
the problem of noun-verb collocational expressions in Hindi using 
Latent Semantic Analysis. Despite these attempts, MWE identification 
is still unresolved as a fully automatic NLP task. 


This paper offers an analysis of Bangla multiword expressions. 
Bangla is one of the major Indian languages having a wide variety of 
MWEs. Though the problem is profound, no elaborate study has been 
done on this language. Some studies on Bangla word formation are 
found in the traditional grammar book like Bhasa-Prakash Bangla 
Byakaran by Suniti Kumar Chatterji (Chatterji, 1939). Some isolated 
research on Bangla idioms (Bhattacharya, 2005), echo words or 
hyphenated words (Bhattacharya, 2006) can be found, which are not 
enough for Bangla multiword analysis in the modern context. Some 
references of additional studies are: analysis of reduplicated words 
(Datta Majumder, 2001); analysis of compound verb formation (Paul, 
2004), noun-verb compositional study (Ghosh, 2006), etc. 


The present paper is an attempt to identify the possible 
categories of Bangla multiword expressions automatically by computer. - 
Here section 2 deals with the general categorization of MWEs while 
section 3 provides a detailed description of Bangla multiword 
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expressions. Section 4 is concerned with an algorithm for processing the 
MWEs in a Bangla sentence. Section 5 describes some experimental 
results on relative abundance of MWEs in a medium-scale Bangla 
language corpus. Also, automatic identification of MWEs has been 
tested on six different text files having about 20,000 words. More 
elaborate software of this nature may be useful in machine translation, 
information retrieval and query answering, among other problems. 
Section 6 concludes the paper. 


2. General Categories of Multiword Expression 


The classification of English MWEs is not unique because of 
difference in the opinion of the experts. In the diagram of Figure 1, a 
possible classification is shown, whose components will be clarified in 
the subsequent discussions. 


Multiword Expression 
Lexicalized Phrase Institutionalised Phrase 
Fixed Semi-fixed Syntactically-flexible 
Expression Expression Expression 


Figure 1: A General structure of MWEs 


2.1. Lexicalized Phrase: According to Bauer (Bauer, 1983), lexicalized 
phrases are the MWEs having at least partially idiosyncratic syntax or 
semantics. The simplest cases may be looked as words with spaces, but 
in this category we can include also the semantically idiosyncratic 
compound noun and _ adjective-noun combinations, idioms, 
verb-particle combinations, and other related constructions. We can 
group them into three sub-categories, as follows. 


2.1.1. Fixed Expressions: There is a large class of immutable expressions 
in English that defy conventions of grammar and compositional 
interpretation. These are called fixed expressions. Such types of 
expressions are fully lexicalized and undergo neither morpho-syntactic 
variation nor internal modification. Some examples are: ‘by and large’, 
‘ad hoc’, ‘in between’, etc. 
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This type of expressions is not unknown to Bangla. Example may 
be cited as: moTer upar (‘by and large’), balA bAhullya (‘needless to 
say), kAje kAjei (‘henceforth’), dharA jAk (‘suppose’), jAi hok 
(‘whatever may be’) etc. 


2.1.2. Semi-Fixed Expressions: Semi-fixed expressions undergo some 
degree of lexical variation in the form of inflection, variation in reflexive 
form and determiner selection. They generally include categories like 
non-decomposable idioms (‘shoot the breeze’), certain compound 
nominal (‘attorney general’) and some proper names (‘San Francisco’). 
Two types of semi-fixed expressions are found in English: one, where 
the components remain fixed, whereas in the other, some components 
may be changed or inserted. 


In Bangla, sometimes components are inserted within a 
reduplicated chunk. Some examples are: jAk bA nA jAk, halei*bA ki nA 
halei bA ki. Here, some part of the expressions remain fixed, whereas, 
additional words are inserted within the fixed part. 


2.1.3. Syntactically-Flexible Expressions: Syntactically-flexible expressions 
exhibit a much wider range of flexibility within itself than those of the 
semi-fixed expressions. In English, generally three types of 
constructions come under this group. 


(1) Verb particle constructions: This type of construction consists of a 
verb and one or more particles. They can either be semantically 
idiosyncratic like ‘brush up’, ‘put off or compositional ie., the use 
of verb particle such as ‘break up’ within the ‘sentence meteorite 
broke up in the earth’s atmosphere’. 


(2) Decomposable Idioms: Sometimes idioms like ‘let the cat out of the 
bag’, which are decomposable in nature, are syntactically flexible to 
some degree. Such idioms are incompatible with words-with-spaces 
Strategy. 


Bangla is not void of such type of constructions. Sometimes, 
idiom-like constructions (specially verbal in nature) such as hATe 
hARi bhAngA, paTol tolA may be flexible in nature. It is difficult to 
determine what type of syntactic variation a given idiom can 
undergo. 


(3) Light Verb-Noun Combination: In this kind of construction, there 
are one verbal element and one nominal element. For example, 
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‘make a mistake’, ‘give a demo’ etc. Like the decomposable idioms, 
such constructions are highly idiosyncratic i.e., it is very difficult to 
predict which light verb combines with a given noun. 


This type of noun-verb collocation is a common feature of Bangla 
language (biye karA, mukta karA). In Bangla, such constructions are 
called composite verbs. But in this language such constructions are 
not flexible in nature (*mukta bhAbe karA is unacceptable), on the 
other hand, mukta mane kAj karA is a grammatical sentence but the 
sense is changed and here the N-V element is KAj karA. 


2.2. Institutionalised Phrase 


Bauer has described another class of English MWEs, viz. 
Institutionalised Phrase. Such phrases are semantically and syntactically 
compositional, but statistically idiosyncratic. Here generally two simple 
senses combine to produce a compositional sense and this composite 
sense has been conventionalised. Examples are ‘traffic light’, ‘telephone 
booth’, ‘kindle excitement’. Being compositional, they often undergo 
full syntactic variability. 


In case of Bangla, such phrases can be put under the name 
Productive Compound Nouns. They are not idiomatic in nature. For 
example, bhAjJA is a component which makes compounds like mAch 
bhAJjA, beguna bhAjA, but *mAnsa bhAjA is not so commonly used. 
Examples of this group may be durgA pujo, selai skul, and etc. It can be 
comparable to aluk-samasa in Bangla, where the case endings are 
retained with the first word, unlike the typical bangla samasa. More 
examples are Age-Piche (‘to and fro’), tele bhAjA (‘fried food’), khelAr 
math (‘playground’), mAmAr bAn (‘maternal house’) etc. 


2.3. Classification based on Compositionality 


Again, in terms of semantic interpretation and compositionality 
(Baldwin et al. 2003) English MWEs can be divided in certain 
categories, as follows: 


Multiword Expression 
Non-decomposable Idiosyncratically Simple 


Figure 2: Semantic structure of the MWEs 
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Non-decomposable MWEs: They (e.g. ‘kick the bucket’, ‘shoot the 
breeze’, ‘hot dog’) are non-decomposable in the sense that no 
decompositional analysis is possible, and the MWE is semantically 
unsolvable in the form of internal modification (* ‘kick the great bucket 
in the sky’) or passivization (* ‘the breeze was shot’). The only syntactic 
variation that non-decomposable MWEs can undergo is_ verbal 
inflection (e.g. ‘kicked the bucket’, ‘kicks the bucket’) and pronominal 
reflexivisation (e.g. ‘wet oneself, ‘wet themselves’). Non-decomposable 
idioms are unable to capture the effects of inflectional variation in 
reflexive form. 


Idiosyncratically decomposable: Such MWEs (e.g. ‘spill the beans’, ‘let 
the cat out of the bag’, ‘radar footprint’) are decomposable but their 
parts show semantic variability to some extent. They undergo a certain 
degree of syntactic variation (e.g. ‘the cat was let out of the bag’). 
Nunberg et al. (1994) observed that idiosyncratically decomposable 
MWEs undergo much greater syntactic variation than non-decomposable 
MWEs, and that the variability can be partially predicted from the 
decompositional analysis. rs 


Simple decomposable: Institutionalised MWEs decompose into simple 
senses and generally display high syntactic variability. What makes 
simple decomposable expressions true MWEs rather than productive 
word combinations are that they tend to block compositional alternates 
with the expected semantics (termed as anti-collocations by Pearce 
(2001)). For example, ‘motor car’ cannot be rephrased as ‘engine car’ 
or ‘motor automobile’. The existence of anti-collocations is a test for 
analyzing decompositionality. 


3. Types Of Bangla MWEs 


For Bangla we have made somewhat different classification of 
MWEs. It is illustrated by the tree structure of Figure 3. The 
elaborations of these classes are given in the following subsections. 


3.1. Repetitive Bangla MWEs 


When the same surface word-form appears consecutively at least 
twice in a sentence, we call it a repetitive MWE. More than 80% 
Bangla MWEs are repetitive words. Repetition of the same word twice 
is called reduplicated word, which is the most common form found in: - 
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Bangla MWEs 
RM NRM 


x P CV I MN 


NSS SS RIW 
O Ip. 


VRW ARW NRW PRW_  IRW = AvRW 


[RM = Repetitive MWEs; NRM = Non-Repetitive MWEs; 
E = Exact; P = Partial; CV = Compound Verb; I = Idiom; 
MN = Multiple Nominal; SS = Sound-Symbolic; Ek. = Echo; 
CmV = Composite Verb; NC = Nominal Compound; HW = 
Hyphenated Word; Pr. = Predictable; UPr. = Unpredictable; 
NSS = Non-Sound Symbolic; RIW = Repetition with Inter- 
mediate Words; O = Onomatopoetic; Ip. = Ideophonic; 
VRW = Verbal RW; ARW = Adjectival RW; NRW = Nominal 
RW; PRW = Pronominal RW; IRW = Interjectional RW; 
AvRW = Adverbial RW.] 


Figure 3: Bangla MWE Classification 
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this language. Such repetitions can occur for almost all parts of speech, 
but are predominant in nouns, verbs and adjectives. 


Reduplication is also common in Onomatopoetic and 
Idecophonic words. Bangla is perhaps the richest Indian language with 
these types of sound symbolic word-forms. 


There are MWEs consisting of a word and its somewhat 
distorted repetition. The distortion occurs in vowel and consonant 
movement. A special type is called the echo-words, characterized by 
change in the first consonant of the word. These may be called partially 
reduplicated words. 


We describe below various repetitive word-forms, especially, 
reduplicated words under the sub-category exact and _ partial 
reduplication. It is observed that mono or bi-syllabic words are more 
often reduplicated. Words having more syllables are _ rarely 
reduplicated, likely due to difficulties in utterance. 


3.1.1. Exact Reduplication 


Reduplication is used both in inflection to convey a grammatical 
function, such as plurality, intensification, etc., and in lexical derivation 
to create new words. Sometimes, reduplication acts as inverse to 
intensification also e.g. AmAr sit sit korche (‘1 am feeling cold’). Bangla 
exact reduplicated words can be divided into three broad categories 
described below. 


3.1.1.1. Non-Sound-Symbolic Words 
a) Verbal Reduplication 


1) Generally, infinite verbs are reduplicated by the addition of 
derivational suffix -te or -e. They indicate continuation of action: 
balte balte (‘while speaking’), hese hese (‘laughing’). Causative verbs 
are also reduplicated as above or with the derivational suffix ive, 
e.g. khAoyAte khAoyAte (‘while feeding’), suniye suniye (‘so that 
others listen’). 


ii) Verbs are sometimes reduplicated just to indicate repetitiveness. 
Examples are: phire phire (‘repeatedly’). 


iii) Repetition indicates sense of interval: theke theke (‘with 
interruption’), theme theme (‘with interruption’). 
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iv) Distributive plurality is expressed by reduplication: beche beche 
(‘selectively’), bale bale (‘effortlessly’) etc. 


v) Sometimes verbs are reduplicated with present and future 
continuous tense marker, where they indicate continuous form of 
tense balba balba (‘about to speak’), dekhba dekhba (‘about to 
see’). Verbs are sometimes reduplicated in first personal 
pronominal form to signify the intention of doing some action.: jAi 
JAi (‘about to go’), dekhi dekhi (‘intend to see’). 


vi) Imperative mood (i.e. to request or to order someone) is also 
indicated by reduplication: jAo jAo (‘repeated request to go’), 
dekho dekho (‘repeated request to look’). 


b) Adjectival Reduplication 


i) Almost all adjectives can be reduplicated in Bangla. They generally 
convey a sense of plurality and intensity: JambA lambA (‘ bats long 
things’), bhArI bhArI (‘several heavy things’). 


u) Adjectives are sometimes reduplicated to indicate some feelings" or 
"likeness" or "nearness" of the property: bokA bokA (‘like idiot’), 
bAccA bAccA (‘childish’). 


iii) Reduplication may signify lightness or incompleteness: phAnkA 
phAnkA (‘nearly empty’). 


iv) Reduplication in countable and uncountable adjectives: sAri san 
(‘row after row), rAsi rAsi (‘heaps of). 


Those adjectives, which are abstract in nature or derived from 
abstract nouns, are usually not reduplicated. Also, more mono or 
disyllabic adjectives are reduplicated. This is perhaps true for all parts 
of speeches. . 


c) Nominal Reduplication 


i) Nouns are reduplicated to represent every-ness: bachar bachar 
(‘every year’), bARi bARi (‘every house’). 


ii) It is also done to generalize the function of the object: ghare ghare 
(‘in every house’), jane jane (‘every person’). 
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i) Reduplication represents continuation: kathAy kathAy (‘while 
talking’), AsAy AsAy (‘with hope’). 


iv) They can indicate something approximate: bhor bhor (‘nearly 
dawn’), sandhe sandhe (‘about evening’). 


v) Reduplication may represent engagement with each other: bhAi 
bhAi (‘a brother with another brother’), bandhu bandhu (‘among 
friends’). 


vi) Abundance may be marked by reduplication: phasale phasale (‘in 
heaps of corps’), meghe meghe (‘with heaps of cloud’). 


vil) Sometimes nouns are reduplicated to indicate the location of 
something in a casual sense e.g., DAle Dale (‘on the branches’), 
pAtAe pAtAe (‘among the leaves’) etc. 


vil) Reduplicated for calling someone: mA mA (‘mother, O mother’), 
ritA ritA (‘Rita, O Rita’). 


ix) Reduplication may indicate exclamatory expressions: rAm rAm 
(‘exclamation of disapproval’), SAdhu Sadhu (‘good, very good’). 


x) Numerals are reduplicated to emphasize the strength of number: 
das das bAr (‘as many as ten times’), pAnc pAncTA (‘as many as 
five’). 


x1) Nouns are sometimes reduplicated to indicate certain type of 
games: chor chor (‘certain type of game’), ghorA ghorA (‘certain 
type of game’), pulis pulis (‘certain type of game’), etc. 


d) Pronominal Reduplication 


i) Relative pronouns are reduplicated to indicate plurality and put 
emphasis: je je (‘those who’), keu keu (‘someone or other’). 


ii) Reduplicated interrogative particles: kabe kabe (‘on which days’), 
kothAy kothAy (‘in which places’). 


ili) Personal/reflexive pronouns are reduplicated to represent self-ness: 
tomrA tomrA (‘you yourselves’), nijerA nijerA (‘own selves’). 


iv) In a few cases, pronouns are reduplicated to indicate the 
possessiveness: jAr jAr (‘of those’), Tar Tar (‘of those’). 
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e) Interjectional Reduplication 


1) To intensify an action or feeling or emotion: mari mar 
(‘exclamation of amazement’), chi chi (‘rebuke, inflecting shame’). 


i) Sometimes interjections are reduplicated as enclitics: hAe! hAe! 
(‘what a disaster’) , SAbAs/ sAbAs! (‘well done’). 


i) Post-positions are also reduplicated in Bangla: mAjhe mAjhe 
(‘sometimes’), bhitare bhitate (‘within’), talAe talAe (‘underneath’) 
etc. 


Generally reduplication conjunctions (though there are forms 
like kintu kintu meaning ‘hesitant’) are not found in Bangla. 


f) Adverbial Reduplication 


Adverbs are also reduplicated to qualify the subsequent verb: 
Aste Aste (‘slowly’), jore jore (‘fast or loud’). Other adverbial forms are 
onomatopoetic word pairs. 


3.1.1.2. Sound Symbolic Words 


Bangla vocabulary consists of a large number of sound symbolic 
words, which are usually reduplicated indeclinable. They can be 
classified into the following types: 


a) Onomatopoetic words 


In Bangla, the sound of utterance of some words automatically 
express their meaning. They belong to the group of onomatopoetic 
words: AA hA, hi hi, dhitAng dhitAng, san san, khakar khakar, Tukus 
Tukus, ghaenar ghaenar, Dhang Dhang. They imply different sounds like 
laughing sound, musical sounds, sounds related to wind, coughing 
sounds, sounds of rain, sounds of water and others. The call of different 
animals also comes under this category: hAmbA hAmbaA (‘call of cow), 
gheu gheu (‘barking of dog’) etc. 


There are some words, which can be used both as single word, as 
well as double word: Dhang Dhang, jhap jhap. 


b) Ideophonic words 


_ There are expressions unrelated to sounds but still has the 
iconicity Jike a sound. They can express fullness, emptiness, deepness, 
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sharpness, colour etc. Although repeated, they often combine into a 
single word in the written form. Some exceptions are: dhu dhu, khAn 
khan, hu hu. 


Some onomatopoetic words are not exactly reduplicated; we 
shall discuss them in the section of partial reduplication. 


3.1.1.3. Repetition After Gap (RAG) 


If a set of words is repeated after a gap of some other words, we 
call it Repetition After Gap (RAG): such as Jete nA Jete. The gap may 
be two or more word long: hok bA nA hok (‘be it or not’). More than 
one neighbouring set of words can form RAG: Halei bA ki, nA halei bA 
ki (‘be it or not’, ‘there is nothing’). Multiword construction can also be 
found with the insertion of ke, bA, nA: e.g. din ke din (day-by-day). 


3.1.2. Partial Reduplication 


In partial reduplication, usually the second word is a modified 
form of the first. Partial reduplication can be Echo word, where usually 
the first consonant is changed. In other cases, vowels are changed in the 
second word. 


3.1.2.1. Sound Symbolic Word Partial Reduplication 


There are cases where a few sound symbolic words are partially 
reduplicated. In a sense, they are also unpredictable echo words, e.g. 
khacar macar, kicir micir, TApur Tupur, rinik jhinik. 


3.1.2.2. Echo Words 


In partial reduplication, sometimes the second component 
echoes the first component with consonantal change. This process is 
called echo word form. The echo words can again be sub-divided into 
two categories; as given below. 


a) Predictable echo words 


Generally, replacing the first consonant by ‘T’ or ‘ph’ in the 
second component makes an echo word. According to Chatterji(1939), 
this convention comes due to Dravidian influence. So, it is a predictable 
rule of Bangla language. This formation is used to indicate plurality and 
homogeneity. In a few cases, ‘s’ is also found in the echo x age 
Some examples are: : 
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1) Noun: bai Tai (‘books and others’), pen phen (‘pen and related 
objects’). 


ul) Adjectives: lambA TambA (‘tall’), rogA sogA (‘thin’). 
11) Verbs: jA0A TaoA (‘going’), thAkA TakA (‘staying’). 
1v) Pronouns: je se (‘who ever’). jeTA seta (‘whatever’) 


b) Unpredictable echo words 


There are echo words that cannot be predicted by simple rules. 
The echo component has been generated by convention, e.g. jemon 
temon (‘somehow or others’), jakhan takhan (‘any time’), Abol tAbol 
(‘arbitrary or random’), homRA comRA (‘big shot’). 


Sometimes the vowels are distorted in the second component. 
Due to vowel harmony, generally low and mid vowels are changed into 
high vowels: sAmle sumle (‘by managing’), kece kuce (‘after washing’). 


It has been found that in most cases mono or disyllabic words 
are echoed, but in some cases, especially in spoken languages 
multi-syllabic words are also sometimes echoed. Here in place of the 
whole word, part of the word is echoed, e.g. mAnabAdhikAr-TadhikAr 
(‘human rights and all’). This word is made of two words - mAnab and 
adhikAr, by the process called sandhi. Here, only the second word has 
been echoed by inserting ‘T’. 


3.2. Non-Repetitive Bangla MWEs 
3.2.1. Compound Verb 


Bengali is an analytical language. As a result, the synthetic 
tendency to form compound is absent but the expression of 
compounding is there. This is one of the reasons of having a 
considerable number of MWEs in Bangla. 


Bangla compound verbs can be divided into 3 categories 
(Bhattacharya, 1993): (4) Perfective Compound (ii) Infinitive 
Compound (iii) Passive Compound. They are described below. 


i) Perfective Compound 


They have at least two components. The first component is a 
perfective aspéct, i.e. they are ended in -e and the second one is a finite 
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verb. Linguistically this first component is called pole and the second 
one is known as vector. Here tense inflections are added to the second 
component: kheye phelA (‘completing the act of eating’), chunye dekhA 
(‘to feel by touching ). 


ii) Infinitive Compound 


In this kind of compound the main verbs having been the pole 
end in the infinitive suffix -te. Again the vector is a finite verb. Eg. 
ghumote deoA (‘allow to sleep’), jete deoA (‘allow to go’) etc. 


iii) Passive Compound 


In this kind of compound the main verbs have the passive marker 
-A -oyA at the end and the subsidiary verbs are always third personal. 
E.g. karA hae (‘be done’), dekhA jAe (‘can be seen’) etc. 


Compounds can also be made of three components: kare base 
thAkA (‘having done’), kheye paRe thAkA (‘having eaten’). Sometimes 
in such construction the first two components are semantically 
significant: dhuye muche phela (‘having washed and wiped’), khule phele 
deoyA (‘opening up’). 


There are more complex examples: paRe paRe mAr khAoyA 
(‘being beaten without offering any resistance’), dekhe dekhe cokh 
pance jAoyA (‘having tired of seeing’). 


This type of multi-word construction can also be found in 
imperative mood for non-honorific sense: kar ge jA (‘go and do’), dekh 
ge jA (‘go and see’). 


3.2.2. Composite Verbs 


Sometimes double word construction is found when a noun is 
followed by a verb. They are interrelated in such a way that their 
individual occurrence does not carry the proper sense: mukta karA (‘to 
free’), jalpAn karA (‘to drink’), kathA oThA (‘mention is made of’) etc. 


3.2.3. Composite Compound Verbs 


This type of construction can be considered as an extended form 
of composite verbs, where a single noun is followed by two pole-vector 
verbs. The noun-verb-verb combination carries a single sense. The 
verb-verb combination can again be compared with compound verbs. 
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E.g. sAnta hoye basA (‘to sit quietly’), gopan kare rAkhA (‘having kept 
something secret’) etc. 


3.2.4. Idioms 


Single-word as well as multi-word constructions are found 
among idioms. These types of constructions are often used in verbal 
discourse. 


Table 1: Different Categories of Idioms 


Idioms Related to Examples 


Human body cul cerA, cokher bAli, mAthA moTA, 
kAn pAtlA, nAk uncu, hAt sAphAi 


History, legends, myths gobar ganeS, nARu gopAl, rAm rAjja, 
and religion dAtA karna etc. 


Rituals and superstitions | kA/rAtn, gangA jAtrA 

Birds and animals kAk bhor, rAghab boyAl 
Foods and fruits barnacorA Am, mAkAI phal 
Non-living elements AkAs gangA, alik kalpanA 
Numerals pAnc kathA, sAt purus 


Slangs BaRa kuTum, dhol gobinda 


i) Noun followed by a verbal: hAl chARA (‘to give up all effort’). 


li) Two nouns side by side to form double-word construction: rAghab 
boyAl (‘big shot’), bhAT kApoR (‘maintenance’). 
iii) Double-word construction from sound symbolic words: thatamata 


khAoyA (‘to be dumb by surprise’), chal chal karA (‘nearly full of 
water’). 


iv) Two or three nouns followed by a verbal. Altogether they form a 
complex syntactical structure: cokher mAthA khAoyA (‘not able to 
see’), kAn bhAn karA (‘to speak ill of). 


v) Multiword construction with one or more nouns: jhAle jhole 
ambale (‘in all cases’), sargo marto pAtAl (‘heaven, earth and hell’). 


vi) Sometimes idioms have reduplicated words: hARe hARe cenA 
_ (‘knowing in detail in derogatory sense’), peTe peTe buddhi (‘very 
“ clever’). © 
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3.2.5. Nominal Compound 


Institutionalised phrases are semantically and syntactically 
compositional, but statistically idiosyncratic. Here all the components 
retain simple senses and combine to produce the compositional 
reading. This type of phrase usually indicates nominal compounds Le. 
combination of nouns. But sometimes the presence of adjective 1s also 
found. This type of compound is not similar to that of Sanskrit nominal 
compounds or samasa. 


1) Noun + Noun (+Noun) combination: bAgdn bARi (‘house 
garden’), maddhAnna bhojan (‘lunch’) etc. 


i) Proper Names: AndAmAn nicobar dlppunja (‘Andaman Nicobar 
Island’), thar marubhumi (‘Thar desert’),.brammahputra nad (‘the 
river Brahmaputra’), gangA nadi (‘the river Ganges’), bibekAnanda 
setu (‘Vivekananda Bridge’), hAORA brij (“Howrah Bridge’). 


iii) Amount of rupees: pAnc TakA pancAs paisA (Rs. 5.50). 
iv) Measure of time: sAtTA callis (7:40 hour). 


v) Sometimes three nouns (in most cases) or noun-verb-noun 
combination is used to form multiword expression in Bangla: kAnc 
kKAnTA hire (‘glass-cutting diamond’). 


A different kind of construction regarding the formation of 
nominal compound can be found in Bangla. 


@ Two nouns with nearly identical meaning may form nominal 
compound. E.g. Amod pramod (‘entertainment’), dukkha kasTo 
(‘woes’), bhAb bhAlobAsA (‘love and affection’). 


e@ Two nouns with somewhat opposite meaning also form nominal 
compound. E.g. AkAs pAtAl (‘heaven and hell’), s4dA kAlo (‘black 
and white’), naram garam (‘polite and rude’). 


3.2.6. Hyphenated Word 


In some cases, multiword converted into single word, are found 
with the interval of a hyphen. They can be categorized in certain groups 
given below: 


i) Negative marker nA added to the root word: nA-dekhA (cunseen’), 
nA-sonA (‘unheard’). 
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ul) In case of ‘every’ phi- is prefixed before noun: phi-bachar (‘every 
year’), phi-san (‘every year’). 


il) Copulative compounds are written with hyphen: kAche-piThe 
(‘neighbouring area’), 7A-meye (‘mother and daughter’). 


iv) Where two successive words unitedly act as specifier of noun, they 
may be _ hyphenated: AhAte-bonA (‘knitted by hand’), 
mastiska-prasuta (‘brain-produced’). 


v) In some cases, foreign words are separated by a hyphen: jouna- 
bisayak (‘sex-related’), diskoveri-chyanel (‘Discovery Channel’). 


vi) Sometimes compound nouns are written with hyphen: sapna-dekhA 
(‘to dream’), gAn-sonA (‘to listen music’). 


vii) Hyphen is found in echo word form: hAte-nAte (‘red-handed’), 
boepAr-scepAr (‘different kinds of matter’). 


vii) Numerals are written with hyphen: tin-tinTe (‘three’), cAr-cArbAr 
(‘four times’). 


ix) Sometimes je-, se- are prefixed before a word with a hyphen: 
je-rakam (‘as it is’), se-janna (‘for this’). 


x) Two or more related nouns are separated by hyphens 
(Bhattacharya, 2006). E.g. Tak-jhAl-nontA (‘sour-hot-salty’), 
bAbA-mA-bhAI-bon (‘father-mother-brother-sister’). 


It has been noted by S.K. Chatterji that hyphenation in Bangla is 
a negative practice. So, the fact is that MWE has become more complex 
and obviously different from other languages. 


4, Computer Processing of MWE in Bangla Text 


As mentioned before, we should have an MWE processor in 
problems related to Natural Language Processing. A _ typical 
diagrammatic scheme of such processing unit is shown in Figure 4. 
Note that the MWE analyser and sentence parser have formed a closed 
loop. This is because the MWE analyser may not detect all MWEs in 
the first pass. In case the next stage finds an anomaly and fails to parse 
the sentence for a particular group of words, then these words are sent 
back to check for a second time whether they constitute an MWE. If 
this attempt fails again, the sentence is rejected. 
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Problems in the analysis of MWE are compounded by the fact 
that an MWE-like construct may behave like a simple composition of 
words. For example, Bikas Aj ‘paTol tuleche’ most likely means ‘Bikas 
has passed away’, but Bikas Aj ekbastA ‘patol tuleche’ should imply that 
‘Bikas has harvested one bag-full of the vegetable paTol’. So, if the first 
stage analyser finds an MWE, it should inform the next stage processor 
if non-MWE interpretation of this group of words is also possible. Then 
the next stage will try to parse that set of words in both ways and decide 
which parse is acceptable. If such decision is not possible, it will pass 
both parses to the next stage for pragmatic and discourse-structure 
analysis. 


Successful 
Parsing 


Parsing Failure 


[TS = Text Sentences; MA = MWE Analyser; MSP = 
Morpho-Syntactic & Semantic Parser; AP = Applications 
Program (MT, QA & others)] 


Figure 4: The Multi-Word Detection Scheme 


- Let us now describe the basics of the proposed MWE analyser 
module. Since an MWE should contain at least two words, the 
maximum number of word pairs we have to check in a m-word sentence 
is m-1, but there can be at most Int(m/2) + 1 number of MWEs in the 
sentence. In reality, the number is much less; usually one or two per 
sentence, if there is any. Occasionally larger number of MWEs may be 
found in sentences with re-duplication, like in Tagore’s song gAche 
gAche, phule phule, pAtAy pAtAy, etc. 


The approach for detection of MWE can have two major 
components, one for lexicalized phrases and the other for 
institutionalised phrases. Each component can again have several 
sub-components e.g. for detection of idioms, exact and _ partial 
re-duplications (e.g. echo words), onomatopoetic word pairs, 
compound and composite verbs, nominal compounds etc. The method 
of processing them will be heavily lexicon-driven and rule based, 
because the reasons for certain word-pairs acting as MWEs may not be 
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defined by simple algorithmic arguments. Another approach is to use 
Statistics that is derived from a corpus of ten million or more words. 
Since it is highly resource and computation intensive, we concentrate on 
the former approach. 


The lexicon mentioned above can be maintained in the form of 
multi-word dictionary, with several information attached to each 
multiword: 


(a) whether suffix can be added and if yes, to which word, 


(b) whether the MWE is rigid, i.e., other words and phrases can be 
inserted between the words of MWE. If words can be inserted, 
what are these words, 


(c) whether non-MWE interpretation is also possible for the subset of 
words (as exemplified above). If yes, whether there is a way of 
detecting them by examining the neighbouring words, (for example, 
ekbastA in the above example). In addition, information about 
syntacto-semantic role of the MWE can also be maintained in the 
dictionary. 


For different categories of MWEs, it is wise to maintain different 
dictionary databases. Little work has been previously done on the topic. 
We have to carefully generate a reasonable set of such word forms. For 
example, in Bangla, about 2500 sound-symbolic multiword-forms have 
been collected by Chaudhuri (2007). Another 5000 idioms are being 
compiled in electronic format. 


But all examples of all categories may not exist in the dictionary 
databases, because of huge possibility of productive forms continuously 
generated in any vibrant language. Echo words belong to one such 
category, which can better be detected in a sentence using rules (like 
the first consonant is replaced by ‘T’ or ‘ph’ in most echo word forms). 
Only the exception to such rules may be stored in an echo-word 
database. 


Rules can also be used profitably in detecting MWEs of certain 
other types. One example is the various reduplicative structures, which 
is very common in Bangla text. 


In-ahy sentence, an n-th order Multiword Construction (MWCn) 
can be expressed as: 
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MWC = W, WW, W, (1) 


where syntactic and semantic roles of the word W, is associated 
with those of w, for all i # j. Inan MWC some words may be repeated. 
If w. = w, for j = 1+1 then reduplication is said to have occurred. We 
can call it Simple Repetition (SR). E.g., kA kA. In case of multiple 
repetition, we have the structure like: 


As mentioned before, word repetition may be found after the 
gap of a few words. The general form of Repetition After Gap (RAG) 
is: 

RAG = w, w, ... w.[W, W,. Wi] w .. W, (2). 


Wo : i 


— 


where repetition has occurred after the gap of W, W, ... W.. An 
example of single word gap is jete nA jete. The gap may be two or more 
words long. E.g. hok bA nA hok. A set of two or three words can also be 
repeated after a gap. E.g. Halei bA ki, nA halei bA ki. 


All RAGs are not treatable as chunk of MWE. E.g. Ramer bAn 
Syamer bAn besh kAchAkAchi. Here the repeated word bAri is related 
to two different subjects and are not MWE. So, only some specific word 
pattern in the gap region makes an MWE. However, simple repetition 
of order two can always make a chunk of MWE. For meaningful 
translation into other language, this chunk should be detected and 
treated as a unit. 


As mentioned, the two-word constructs, 1.e. MW C, = W, W, are 
abundant in Bangla. The so-called nominal compounds also come 


under this category. However, SR is most frequent in Bangla sentences. 


Longer MWCs are idioms and metaphoric constructs. However, 
the value of ‘n’ in Equation (1) rarely exceeds 6. In our algorithm, we do 
not consider more than six consecutive words for automatic detection 
of MWCs. Idiom-like words are semantically important, so our 
approach for computer processing is list-oriented rather than 
rule-oriented. 


A special class of MWC is onomatopoetic or sound symbolic 
words, which are located through database. On the other hand, we have 
used rule-based approaches for echo word-forms. 
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A typical algorithm to detect MWC from Bangla sentence is as 
follows: 


i) Delete the emphatic marker from the words. If any word is 
repeated in the sentence, then flag positive and number the distinct 
repetitive words as w), W5, etc. from left to right. If w, and w, have 
no word in between, then make it a word chunklet. Allow each 
chunklet to grow as much as possible. Call it simple repetition SR}. 
Find all other simple repetitions in the sentence. 


ul) Go to the first chunklet from left. Let this be CW,. 


ii) Continue till the same CW, appears next time in the sentence. If 
the words between them belong to a group in the database, call it a 
repetition after gap RAG,. Find all RAGs. 


iv) For each word w, in the sentence, check if w; , , satisfies the echo 
word-form or not. If yes, detect it as echo-word. Find all 
Echo-words using both rule-base and database. 


v) Find if w,,, is a vowel-changed version of w,, using a rule-base of 
vowel change. 


vi) For any w, that is a verb word, check if w; , , is a non-repetitive verb 
word also. Then check if w, , , belongs to a small set of verbs which 
act as vector to the polar verb w.. If yes, declare the pair w; w,,, as 
a compound verb V... Find all V. in the sentence. 


vil) For any w, that 1s a noun having verbal property, check if w, , ; is a 
light verb. If yes, call the composite verb subroutine to confirm the 
composite verb. 


viii) Use the idiom database to check if any part of the sentence belongs 
to this category. 


5. Results 


Two sets of experiment have been done for the analysis of 
Bangla MWEs. The first experiment tries to identify the relative 
presence of different MWEs from a medium size corpus of Bangla text, 
collected from different printed media. 


The second experiment is about automatic identification of 
MWEs from a few sets of smaller corpus. Moreover, we have attempted 
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to provide some linguistic properties and explanations about MWEs of 
various types. 


5.1. Relative abundance of Bangla MWEs 


We have used the TDIL corpus of about 3 million words to 
manually detect the MWEs. At first, we have detected repetitive words 
from the corpus. Since reduplication has the lion’s share of such 
constructs, fifteen most frequent reduplications found in the corpus are 
Shown in table 2. Actually, we obtained about 12,800 distinct 
reduplicated word-forms in the corpus. 


In the corpus we have found 27.2% noun reduplication, 5.3% 
adjective reduplication, 30.5% verb reduplication, 5.3% adverb 
reduplication, 3.6% pronoun reduplication while interjection is fousd 
only in 0.6% cases. The onomatopoetic reduplication is 17.4% of the 
total number of reduplications. | 


Table 2: Reduplicated Word Frequency List 


Sange sange kichu kichu Aste Aste 
kon kon mane mane par par 
keu keu bAr bAr 


dhIre dhIre choTo choTo hAjAr hAjAr 
baRo baRo karte karte dekhte dekhte 


[RW = Reduplicated Word; F = Frequency] | 


In table 3, the topmost ten repetition after gap RAG words are 
shown. We observe that for all examples in the table, the intermediate 
word is nA. Actually, in Bangla language the use of nA is among the 
most frequent ones. Some RAG words, other than intermediate nA are 
kaTA theke kaTA, ekTir par ekTir etc. We got one hundred distinct 
RAGs from our corpus. For sentence parsing one should treat them as 
single chunks. 


We can maintain a dictionary of the most common repeated 
words after gap (RAG) and use it for various applications software 
design. The work has started and its details will be reported in a future 
correspondence. 


One of the most common types of MWEs is the hyphenated 
word. In table 4, a list of most frequent hyphenated words found from 
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Table 3: RAG MWEs Frequency List 


RAG MW 


paRte nA paRte 
ekTA nA ekTA 
ekdin nA ekdin 
Hok bA nA hok 
Aste nA Aste 


kon nA kon 
hate nA hate 
Jete nA jete 

kichu nA kichu 
kono nA kono 


the corpus is provided. We obtained around 18,000 distinct hyphenated 
MWEs from this database. 


Table 4: Hyphenated Word Frequency List 


Je-kono 5 uttar-pascim ) se-sab 

Je-sab Sujog-subidhA AdAn-pradAn 
bhU-pns Ther Je-sakal daksin-purba 
ba-dlp pariksA-niriksA | 52 | daksin-pascim 


[HW = Hyphenated Word; F = Frequency] 


Generally, people do not use hyphen so systematically, 1.c. for 
the same pair of words sometimes they use hyphen and sometimes not. 
Lack of clear-cut rules may be one of the reasons of getting so many 
hyphenated words in the corpus. 


In table 5, the most frequent echo words available in the corpus 
have been shown. In Bangla, the echo words are more frequent in 


Table 5: Echo Word Frequency List 


Jakhan takhan Age bhAge 


Ase pAse eTA seta 
khAoyA dAoyA kApaR copaR 
eTA oIA posAk AsAk 


AnAce kAnAce asukh bisukh 
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spoken form. So, except for the dialogue-based writing the number of 
such words is not large in written documents. 


Since compound verbs play a major role in forming the structure 
of Bangla MWEs, a proper identification of them as a chunk is useful. 
The following most frequent compound verbs have been identified from 
the corpus. 


Table 6: Compound Verb Frequency List 


hoye geelo niye JAoyA 
hoye jAbe poRe geelo kore base 


cole geelo goRe uTheche choRiye pore 


kore dey poRe JAy niye JAy 
kore deoyA cole gelen hoye giyeche 


[CV = Compound Verb; F = Frequency] 


Noun-Verb collocation is another important feature of Bangla 
MWE. Linguistically this type of construction is called composite verb 
or conjunct verb. In table 7 some most frequent composite verbs found 
in the corpus are shown. 


Table 7: Composite Verb Frequency List 


beebohAr karA | 487 | lakhya karA : kathA balA 
Suru kare bandha kare sAhA}jya kare 
srisTi kare grahon kare LAbh kare 


kA) kare ullekh karA beebohAr kare 
nirbhar kare AlocanA karA | 2 grahon karA 


[CmV = Compound Verb; F = Frequency] 


We have also found small number of other MWEs from the 
corpus and collected them in a database. 


5.2. Automatic identification of the MWEs 


As mentioned earlier, in this section we have presented the 
results of machine identification of possible Bangla MWEs. We have 
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randomly chosen six Bangla text files containing approximately 20,000 
words for our present experiment. Also, recall, precision and f-score 
have been used as our evaluation measures to test the performance of 
our system. 


Reduplication 


Identification of Bangla reduplicated word is more or less 
accurate since it is a straightforward task. In most cases, it is possible to 
get nearly 100% accuracy in identifying the reduplicated words. But 
some difficulties have been encountered during the identification due 
to following reasons. 


e@ Absence of punctuation marks between two identical words of two 
consecutive sentences (due to inaccurate proof-reading or other 
reasons) may lead to false identification of reduplicated words. 


e@ Typing mistakes, if not corrected during proof-reading, may also 
create identification problem. 


We had to face similar problems for RAG word identification as 
well. 


Hyphenated Word 


Typing mistakes can create problem during the identification of 
hyphenated words. For example, bikele-apis (‘office in the afternoon’) 
has been observed in the corpus, which should not be hyphenated. We 
came across several such situations in the corpus. Suffixes like -ke, -er, 
-i, -o after hyphen were not considered as part of hyphenated words 
during identification. This resulted in about one percent error. Better 
results can be expected if we take care of these cases. 


Echo Word 


For the identification of echo words, an exhaustive list is needed 
because they are predictable only when the second component starts 
with ‘T’ and ‘ph’. So, the accuracy of our results depends on the 
completeness of the list. The more complete list will provide us more 
improved result. 


Compound Verb ._— 


For the identification of compound verbs, we made a vector list 
(the verbs which are able to form compound verb). On the other hand, 
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being infinite verb, the pole generally ends in -e or -te. This feature is 
helpful for identification of the compound verbs. A Bangla verb-suffix 
list has also been maintained for identification of verb. 


During identification of compound verb, word pairs like kine 
diyechilAm (‘brought him/her’), neme jAcche (‘going down’), egiye 
Asche (‘coming close’), have been identified as compound verbs 
because of their pole and vector combination. But linguistically they are 
not compound verb, since meanings of both verbs in the word pair are 
equally dominant. 


The combination of more than two consecutive verbs is not 
considered here. 


Table 8: Identification Performance of Compound Verbs 


File 1 
File 2 
File 3 
File 4 
File 5 
File 6 


Average 


Composite Verb 


For the identification of composite verbs, two aspects are very 
important. The first aspect is, we have to choose the noun (verbal 
noun), which can form composite verbs. The second aspect is the 
selection of proper verbs (light verb) that forms such composite verb. 
We have marked the action like nouns that may form the composite 
verb in the computer dictionary. On the other hand, the light verbs are 
marked in the verb-list. 


In Bangla, demarcation between Noun and Adjective is not very 
strict. Sometimes an adjective behaves like a noun, especially before the 
light verbs. E.g. jwaljwal karA (‘sparkling’). But during identification of 
nouns, we have not considered such adjective words. Better result is 
expected after considering the above situation. 


Here we have considered the combination of a noun and single 
verb only. 
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Table 9: Identification Performance of Composite Verbs 


[Recall Presson, FScore 


File 1 
File 2 
File 3 
File 4 
File 5 
File 6 


Average 


Composite Compound Verbs 


The nouns, which are eligible for forming composite verbs, are 
also considered as the nominal part for this category. The second most 
important aspect is that the verb-verb combination has to be identified 
as a compound verb. The qualities of the verbs must be similar to those 
of pole and vector of a compound verb. 


The possible combinations of nominal compound verbs have not 
been studied exhaustively, so we are yet to get consistent and reliable 
results for identification of such type. At present, we are examining the 
possible grammatical combinations of such constructions. This type is 
not abundant in Bangla corpus, so we have not presented any statistical 
figure. 


Nominal Compound 


The identification of nominal compounds requires some 
measure of list-based approach. We have not worked on this in great 
details. 


We have started our work in the following direction. We have 
maintained a dictionary of words that may form nominal compounds. 
We have also taken some rule-based approaches. It has been found that 
there are certain trends of forming nominal compounds in Bangla. 
Some of them are: 


© Ifanoun immediately precedes words like skul (‘school’), bidyalaya 
(‘school’), kalej (‘college’), bisyabidyalaya (‘university’) then the 
whole unit of words is categorized as nominal compound. 
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e Ifthe preceding words of nad (‘river’), nadi (‘river’), setu (‘bridge’), 
brij (‘bridge’), sarani (‘road’), etc. are nouns, they also can be part of 
nominal compounds. 


@ If mi: (‘Mr.’), Shree (“Mr.’), Srimati (‘Mrs.’), kuman (‘Ms.’) precede 
nouns, then they together can form nominal compounds. 


Idioms 


For the study of Bangla idioms we have to prepare an exhaustive 
list, which requires plaintive man-power and huge amount of time. In 
the absence of these we are not yet able to prepare a sufficient list of 
idioms. Only then we can study the nature of Bangla idioms. We are 
working in this direction. 


6. Conclusion 


An initial study of Bangla multiword expression from the 
computational point of view has been done for the first time. We tried 
to enumerate the groups of word that must be considered together for 
both syntactic and semantic interpretation. However, it is agreed that 
the list given here is not exhaustive. For example, we did not work on 
metaphors like bAnglAr bAgh (‘the tiger of Bengal’) or bidyer jAhAj (‘a 
ship of knowledge’). Future researchers may find newer groups under 
this category. 


Some quantitative evaluation has been made from a medium 
corpus of 3 million words about the real use of such multi-word 
constructs. It is noted that re-duplication of various forms, nominal 
compounds and the compound verbs are most frequently used 
categories. Onomatopoetic words are also noted in the text. Pro :rbs 
are not so commonly used, though about five thousands words of them 
have been compiled in Samsad Bagdhara Abidhan (Bhattacharya, 
2005). To get a more reliable statistics, a corpus of ten million words is 
needed. 


Some preliminary computational works have been done to 
identify various MWEs from arbitrary sentences. We have mainly used 
dictionary and rule-based approaches. However, to improve the 
reliability and accuracy, more sophisticated statistical techniques like 
Markov random field model may be employed. Work has started in this 
area and useful results will be presented in a future publication. 
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DIALECT SURVEY OF MALAYALAM IN RETROSPECT 
B. GOPINATHAN NAIR 


This paper contemplates on the ‘Dialect Survey of Malayalam. 
Ezhava/ Tiiya’, the first systematic, rigorous and comprehensive study 
executed four decades ago in the Department of Linguistics, University 
of Kerala and provides a brief review. The study pertains to the speech 
variations of a major community- Ezhava/Tiiyas spread throughout 
Kerala exemplifying the diagnostic linguistic variations across social 
and regional parameters, demarcating the chief dialect areas and sub 
dialect areas on the basis of bundles of isoglosses drawn from different 
levels of linguistic structure showing their nature and spread. It was 
completed in 1969 and the report published in 1974. The atlas 
comprising 293 dialect maps which is complementary to the report, 
however could be published only recently in June 2006 by the 
International School of Dravidian Linguistics, which prompts this 
reflection as I was one of the investigators of the project. A quick 
glance at the study and Survey of Malayalam dialects would be relevant 
in this International Conference that envisages a comprehensive study 
and survey of Tamil dialects and an evaluation of the state of dialect 
studies in South India in the backdrop of the advances made in 
traditional dialectology and modern sociolinguistics. 


The Malayalam dialect Survey Project directed by VAI. 
Subramoniam as the Principal Investigator along with a team of six 
researchers in the mid 1960’s continued for a decade, successively 
surveyed the dialects of the major communities viz. Ezhava/Tiiya, Nair 
and Harijan spread throughout Kerala, collected, analyzed and 
preserved considerable data, which is a treasure house for further study 
of speech variations in several other dimensions. Among the team VI. 


* Paper presented at the International Conference on Dialects in Tamil 
sponsored by the French Institute of Pondicherry (FIP) and C.I.1.L., Mysore at 
Jawaharlal Nehru Conference Hall, FIP from 23rd to 25th August 2006. 
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Subramoniam, G.K. Panikkar, C.J. Roy and myself are the surviving 
members who witnessed the publication of the dialect maps of 
Malayalam after 37 years of its completion. Looking at the work in 
retrospect my candid reminiscence goes back to four decades when a 
methodical attempt for the serious study and survey of Malayalam 
dialects had begun in Kerala. 


This paper is dedicated to the fond memory of my colleagues, 
late T. Velavan, P. Somasekharan Nair, R.V.K. Thampuran and A.P. 
Andrewskutty who contributed their share well in this academic pursuit 
with sincerity, honesty, dedication and left the world of dialect diversity 
for eternal peace. 


Dialect variations and dialect diversity in any sizable speech 
community spread throughout a larger territorial region is a 
sociolinguistic and socio-cultural reality which can be studied in several 
perspectives by different ways and means. There is no single approach 
that would satisfactorily explain the multifaceted dimensions of dialect 
diversity. A close examination of the social and structural correlates of 
linguistic diversity would reveal that language and society are not 
discrete categories but well structured systems and they are 
heterogeneous. It is also true in the case of dialects irrespective of caste, 
religion or region. Besides, there may be overlapping of caste, 
communal and regional features. 


Following D.P. Pattnayak’s article on ‘Caste and Language’ 
(IJDL.IV.1.1975: 97-104) where he states that caste dialect is a myth 
and it is an a priori assumption, several of his contemporaries and 
commentators (IJDL.IV.2.1975: 361-78, IJDL.V.1. 1976: 82-85) had 
vehemently argued for and against the existence of caste dialects in 
Indian social contexts exemplifying with materials drawn from several 
sources and levels of speech and projected the then existent diverse 
opinion. Later in his concluding remarks Pattnayak himself has 
categorically affirmed that caste is one of the predominant variable and 
not the sole variable and his motive was to create an awareness among 
caste dialect investigators at that time and to urge them to attempt a 
dynamic approach to the study of linguistic variations as against the 
traditional static approach which was a scholarly and well intended 
remark. 


Leaving aside, some of those assertions emerged from a few 
scholars which could be viewed as stereotyped, biased etc. what 
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remains is the unbiased notion of dialect variations with regard to caste, 
religion, region, age, sex, education profession, economic status and 
such other factors leading to social stratification including formal vs. 
informal, stylistic variations etc. independent or interdependent, if any, 
of the social parameters of linguistic diversity. It is in this context one 
has to evaluate the form, content and function of the social and 
structural correlates of linguistic diversity in South Indian society both 
rural and urban settings, profitably based on some specific hypothesis, 
theoretical framework and methodological techniques. 


The Emergence of Malayalam Dialect Surveys 
Assumptions and Goals 


The Dialect Survey Project envisaged the following assumptions 
and goals: 


@ The make-up of each community based on caste or religion can be 
gauged. The conversion to Islam is from the lower strata of society. 
Similarly, it will be possible to make inferences regarding the other 
religious communities if surveys are completed. 


e@ The movement of population whenever historical evidences are not 
available can be traced. 


e@ Textual criticism will stand benefited by identification of dialectal 
forms. For instance, the accusative case form in North Malabar is - a 
whereas it is -e in Southern Kerala. The edition of KrishnaGatha of 
Malabar has been corrected to have -e wherever it had -a due to 
ignorance. So also the correction of 7k as 77 and fic as fin in the 
modern editions of old Malayalam words in Kannagéa 
Raamaayanam is against the evidence of dialects. 


@ The standard dialect formation can be gauged by facts from dialects 
and the elite formation. The standard dialect is generally 
sanskritised and therefore too unnatural. When a standard 
colloquial is formulated teaching in the language in a generally 
acceptable dialect is possible. 


@ When history has no records to prove the existence of principalities, 
dialect survey work and place name surveys can sometimes 
authenticate it. 
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@ The sociological theory that Sanskritization is less with the lower 
caste Harijans can be tested by dialect work. 


@ The history of a language can be complete only when the dialect, 
which is the functional language, is analyzed and described. 


@ The definitions of dialect, language and idiolect are still unclear 
which can be made precise only when the dialects are analyzed. 


e A language tribal or non-tribal is the cultural heritage of our country 
and it has to be preserved. 


e@ Exploring the diverse social customs with regard to caste, 
community and religion may help to draw relevant conclusions on 
regional history including the political changes and social reforms. 


e@ Study of dialect formation, which is a pan Indian feature, would help 
to make out the causes for sociolinguistic and regional changes, 
formation of standard dialect, which can be used for formal 
communication, finding out archaic forms to infer the prehistory of 
language. 


e@ To prepare a thorough historical grammar for Malayalam on 
linguistic principles. 


e@ To enrich the Malayalam lexicon by supplying dialect forms which 
have so far not found a place in any of the Malayalam dictionaries. 


Pilot Survey Project: Pulaya Dialect 


This is the first project undertaken in 1958 to sample the dialects 
of Malayalam as spoken by the Pulayas of the Harijan community under 
the auspices of the Tamil Department, fully sponsored by the 
University of Kerala. This survey had helped in collecting dialect 
material from a questionnaire comprising 430 items along with stories 
and dialogues from two informants each from the erstwhile 9 districts of 
Kerala!. The Principal Investigator, V.I. Subramoniam, did the whole 
fieldwork. After tape recording the material he had analyzed the 


1. They are: Trivandrum (Tr), Quilon (Q), Alleppy (A), Kottayam (K), Ernakulam (E), 
Trichur (T), Palghat (P), Calicut (C), Cannanore (Cr). 


Now there are 14 districts; the additions are: Pathanamthitta, Idukki, 
Malappuram, Wayanad and Kasargod. 


The former districts are now spelled as Thiruvananthapuram, Kollam, 
Alappuzha, Kottayam, Ernakulam, Thrissur, Palakkad, Kozhikkode and Kannur. 
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phonology and published a paper Phonemic Outline of a Dialect of 
Malayalam (1L.23.1962). The tape-recorded material has been used by 
the 1967 M.A. students of Linguistics for preparing descriptive 
accounts of the phonology and morphology for their dissertations. 
Owing to the relatively small word list, poor quality of recordings in 
some cases and less experience of the students in preparing descriptive. 
grammars, the outcome of the Pulaya dialect survey had its limitations. 
However, a broad outline of the variations and marking the Malayalam 
dialects into five regional varieties on the basis of this caste were made 
viz. (1) Northern dialect comprising Cannanore and Calicut (ii) Cochin 
dialect including Ernakulam, Trichur and part of Kottayam districts 
(iu) Travancore dialect comprising the rest of Kottayam, Alleppey and 
Quilon. (iv) Palghat dialect (v) Trivandrum dialect. Besides, the 
Laccadive speech reckoned as separate one, altogether six regiohal 
dialects were identified for Malayalam at that time. 


Small-scale Dialect Survey Project: Ezhava/Tiiya 


Inspired by the outcome of the marked regional differences in 
the dialect of the Pulaya community, a small scale Dialect Survey 
Project was implemented in the Department of Linguistics to study and 
survey the dialects of Ezhava/Tiityas. 


The Ezhava/Tuyas spread throughout Kerala belong to a major 
community among the Hindus and they are variously called as Ezhavas 
in South Travancore area, Tiiyas in Malabar, Hosdurg, Kasargod and as 
Billavas in South Canara. They are also known as Chovas 1n other areas. 


The Tiiyas claim superiority over Ezhavas both in the social and 
cultural affairs and consider themselves as a superior caste. However, 
the Ezhavas and Tiiyas are generally treated as one and the same caste 
as there was practically no mixing among the groups and the different 
names depict more of a regional identity rather than any caste-specific 
feature. 


The dialect survey project started from mid July 1965 under the 
direction of the Principal Investigator V.I. Subramoniam. Three ~ 
Research Assistants were selected, intensive training given to them in 
phonetics and linguistic analysis to carry out fieldwork. As dialect study 
is related to other disciplines such as history, geography, sociology etc. 
before directly entering to the proposed dialect survey, the team made 
certain preliminary studies and discussions regarding foreign influence 
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in Kerala history, the tribes of Kerala, the history, methods and 
techniques of dialectology, the languages of Kerala etc. 


In the first field trip they selected three points in rural areas 
which showed conspicuous dialect differences, stayed there for three 
months, collected material with the help of a preliminary word list of 
2600 items, transcribed the data in phonetic script and also recorded it 
in tapes. After returning from the field, the word list was critically 
reviewed and revised. The second field trip was conducted for three 
months and three more points covered. Grammars and dictionaries 
were prepared for all the six localities representing the erstwhile 
Malabar, Cochin and Travancore regions, which coincided with the 
political division that was in existence in Kerala until 1956. 


Word List 


For finalizing the dialect grammars and to locate broadly the 
differences between the dialects, the preliminary word list was 
successively revised which contain 3001 items useful for linguistic, 
cultural and sociological investigations. It has two parts. The first part 
consists of 2144 items including common words and about 650 verbs. 
The second part contains mainly phrases and sentence patterns to elicit 
grammatical categories viz. singular plural concord, gender differences, 
case, qualifiers and modifiers, verbal participles, verbal nouns, tenses 
and simple sentences of different types. 


A continuous text containing popular mythical stories, folk tales, 
festivals, marriage functions, conversation between two informants etc. 
has also been included and the speeches of the informants were 
tape-recorded. The minimum expected yield was an outline grammar 
and a dictionary comprising of 3000 words. The productivity of the 
word list has been tested and refined after the first fieldwork. 


Dialect Dictionary and Grammars 


The items in the word list were utilized in the dialect dictionary. 
The variants collected from 6 regions for each item were recorded from 
north to south, mostly in phonemic script except those items meant for 
eliciting phonetic variants. The dictionary thus helps to find out the 
number of variants for an item and their places of occurrence and it is a 
useful mode of presentation of dialect forms. 


Detailed descriptive grammars for the six regional dialects were 
also prepared. Contrasting the Dictionary and Grammar at the six 
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points showed considerable differences. The frequency of each change 
was counted so that the final questionnaire could contain all the 
frequent changes. Lexical substitution was found to be most frequent 
followed by lexical and phonemic free variations, elision of units etc. 


The Final Questionnaire 


The front page of the final questionnaire is meant for eliciting 
the biographical and sociological information of the informants. The 
names of the District, panchayat, village and the name, age and 
occupation of the informant, length of residence in that area, birth 
place, if married, the birth place of his wife, his travel experience, 
information regarding the other communities of the locality and his - 
educational background (cf. Appendix 1). All informants selected were 
men, preferably of the age group 55-60. One informant from each of fhe 
then existent 921 panchayats and their responses recorded. 


Leisurely administration of* the questionnaire containing 
altogether 255 items, at a point took nearly 2 hours and 30 minutes. 
Speedy administration took one hour and 45 minutes. 


The investigators were instructed to note on the margin of the 
questionnaire, the reactions of the informants to the questions, the 
various responses among women, children or other castes and 
communities and their identification of words as Tamil, Tulu, Kannada 
or rare, modern, used by elders or younger and the differences they 
have observed between their dialect and that of the others. Taboo 
words also to be incorporated in the marginal notes. Cultural 
information, the informant’s attitude to his own dialect and all those 
have proved to be useful in assessing the sociological and cultural 
factors involved in the dialect differences. 


The questionnaire has two parts of which the first contains 250 
naming questions. They begin with question type ‘what is the name you 
use for...’ The probable and attested variants in the dialects analyzed 
and the dictionaries prepared are given against each question. Dotted 
lines are provided under the set of words to record fresh response. The 
second part contains homonyms with their meanings. Only five have 
been included in this questionnaire. The dotted lines are also given 
underneath to record further meanings, if any (cf. Appendix 2). 


The order of importance given in the questionnaire is as follows: 
phonemic, lexical, morphological, phonetic and semantic variants. See 
the following few examples: 
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Phonemic Replacements 
v/b _vellam/bellam ‘water’ en veeli/beeli ‘fence’ (36) 


s/§ | masi/masi ‘ink’ (1), c/$ ceelu/Seelu ‘beautiful (4), 
c/® céttan/éttan ‘elder brother’ (143) 


Vy — kooli/kooyi ‘hen’ (2), kelakku/keyakku ‘east’ (18), 
pulu/puyu ‘worm’ (34) 


u/o = mula/mola ‘bamboo’ (39), muttu/mottu ‘pear!’ (40) 
Ve — ilaya/elya ‘younger’ (13), iratt/eratti ‘double’ (16) 


a/e narampu/ferampu ‘vein’ (55), cannati/cennati ‘friend’ (60) 
Lexical Replacements 


The lexical group of fifty five words include names of birds and 
animals, plants, trees and their yields, words for body parts, household 
articles, ceremonies, castes and kinship terms. The elicited material 
shows considerable variants and many of them are not found recorded 
in the published Malayalam dictionaries and lexicon. 


Morphemic Replacements 


This group includes suffix replacements viz. -inRa / -inRe / -ute 
‘genitive case’ (193) -nnu / nu ‘present tense’ -kal/ -al ‘plural’ (191) -atti / 
-i ‘feminine’ etc. (190) 


Variant Personal Pronouns: naal / impal /nannal, ‘we’ (Excl.) (185), 
niyyu / iyyu / nu ‘you’ (Sg.) (186), oon / avan, ‘he’ (187) oolu / avalu, 
‘she’ (188) ooru / oolu / avaru ‘they’ (189). 


Phonetic replacements include vocoid variations: maram / mar m 
‘tree’ (171)i/w/a : kayyi / kayyw / kayyd ‘hand’ (172 (b)). 


Contoid Variants: kk / kky atakka / atakkya ‘areca nut’ (II.1) 


Homonyms 
paakku ‘areca nut, ceiling, bag’ (II.1) 
oram ‘back shoulder, manure’ (II.2) — 
kollan ‘blacksmith, shoemaker’ (II.4) 


toolu ‘armpit, shoulder’ (IL.5) 


a 


2. The numbers within brackets indicate the serial number of items in the 
questionnaire. 
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tulluka = ‘to throw, to jump, to frisk’ (I1.3) 


Preference was given to indigenous vocabulary than to 
loanwords because loan words were later in points of time than native 
items. 


If a phonemic or allophonic variation is recurring in several 
words like b-/v-, which has a larger frequency, it is taken as an item 
suitable for the questionnaire. 


Items showing feasible sound changes; elisions of any unit, if 
consistent and recurring in several words were preferred. 


The differences in phonology between the dialects were 
numcrous. Even in the replacement by zero we had noticed that the 
phenomenon was found in almost all words, in some, zeroes replaced 
morphemic units. The chief aim of the questionnaire is to identify 
differences in dialects and to demarcate the dialect boundaries based 
on linguistic features that are diagnostic of each region. 


Preparation of Dialect Maps 


The dialect atlas is a very effective mode of presentation of 
dialect variants. Besides listing the variants of an item it shows the 
spread of each form in geographical area whether contiguous or 
non-contiguous, which on the other hand may not be obtained from a 
dialect dictionary. Both are complementary and indispensable for the 
presentation of dialect material. 


From the copies of the filled up questionnaire brought from the 
field, the items with their variants were entered on separate paper and 
under each variation, the name of the District and the number of the 
Panchayat which provides all information regarding the frequency and 
the variety of forms were noted. Xeroxed tables for the purpose of 
filling the details were prepared with symbols in the vertical left hand 
column and the names of the Districts in the horizontal column on the 
top. When the boxes were filled, the table showed the frequency of the 
forms noted against the symbols in each District (cf. Appendix 3). 
These tables along with the map give a complete picture of the form, its 
variants, frequency and contiguity of the Panchayats where it occurs. 


The boundaries of all the 921 panchayats were marked on a map 
of Kerala of the scale 1"=9 miles which was used as the base map (cf. 
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Appendix 4). The symbols assigned for each item were marked with 
black ink in each panchayat according to the occurrence of the item. 
Later stamps with symbols were prepared and made use of which 
helped to save time besides marking the symbols uniform in size. 


260 main maps and 33 subsidiary maps were drawn. Altogether 
the number of maps drawn to indicate the spread of items is 293. 


On the basis of the spread maps, a dozen isogloss maps 
indicating the dialect boundaries have been drawn (cf. Appendix 5). 
Maps help in interpreting dialect features for the history of a language, 
besides pinpointing the dialect areas. The directions of change of 
features can also be broadly guessed from isogloss maps. 


Marking of Dialect Areas 


_On the basis of the spread maps, 12 major dialect areas have 
been identified for Malayalam, besides the isogloss items that separate 
one dialect region from the other were also specified on the basis of the 
Ezhava/Tiiya dialect survey. They are (cf. Appendix 6): 


1. South Travancore dialect comprising all panchayats in Trivandrum 
District and the panchayats south of Quilon District. 


2. Central Travancore Dialect - northern portion of Quilon District 
and certain portions of Alleppy and Kottayam District. 


3. West Vempanad Dialect - northern portion of Alleppy District, 
west of Vempanad Lake. 


4. North Travancore dialect - Almost all panchayats of Kottayam 
District. 


5. Cochin dialect - Ernakulam District and certain portions of Trichur 
District. © 


6. South Malabar dialect - most of the panchayats of Trichur District 
and few panchayats in Palghat District. 


7. South Eastern Palghat - South eastern region of Palghat District. 
8. North Western Palghat - certain regions of Palghat District. 


9. Central Malabar Dialect - Southern parts of Calicut District and 
certain panchayats of Palghat District. 
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10. Wayanad dialect - Eastern region of Calicut District. 


11. North Malabar dialect - southern portion of Cannanore District 
and major portion of Calicut District. 


12. Peak Dialect - northern panchayats of Cannanore District. (This 
name was given because of the northern most region of Kerala). 


Besides, these major dialect areas 32 sub dialect areas were also 
marked along with the isogloss items. (For details c.f. Subramoniam 
1974: 655-688). 


The report of the Dialect survey of Malayalam: Ezhava/Tiya 
comprising 689 pages with 15 maps demarcating the 12 major dialect 
areas and 32 sub dialect areas and the various isoglosses along with a 
map showing the 9 districts of Kerala and an appendix of the 
questionnaire with 260 items was printed in 1974 by the Kerala 
University which is out of stock and the International School of 
Dravidian Linguistics will reprint it shortly as it is complementary to the 
published Dialect Maps of Malayalam.” | 


Nair Dialect Survey 


The Nair dialect Survey as an extension of the Ezhava/Tiiya 
dialect survey began in mid July 1968. One of the major goals of the 
Nair dialect survey was to check whether the 12 dialect areas of the 
Ezhava/Tiiya dialect coincides or not with the Nairs, besides to locate 
the caste bound isoglosses, the difference between the two castes with 
reference to areas, isoglosses etc. The same procedure of collecting 
materials from 12 points already identified, preparing detailed 
grammars, contrasting them to note down similarities and differences 
was followed, a dialect dictionary of 3000 items compiled and a 
questionnaire prepared with additional 43 items, besides those included 
for the previous survey comprising altogether 303 items. The final field 
survey at 921 panchayats to fill up the questionnaire had been 
conducted, the items indexed and 308 dialect maps were prepared. 
Frequency charts showing the comparison of Ezhava/Nair dialects were 
also prepared by using the material of the previous survey. The maps 
drawn show that the boundaries of the Ezhava/Tiiya and Nair dialects 
were not the same. They differ in the border areas within the centre of 


3. ‘The Second Edition of the report of the Dialect Survey of Malayalam: Ezhava / 
Tiiya was published by |.S.D.L. in 2007. 
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influence different for each community. The regional variations are not 
always shared by the two castes. This is only a random inference. The 
isogloss maps when dawn completely would give a clear picture. 


Harijan Dialect Survey 


Finding the tentative results of the Nair dialect survey as most 
productive and stimulating, the survey was extended to the Harjjans for 
a period of three years, because the isoglosses marked by the earlier 
castes had to be confirmed, besides this, the amount of Sanskrit 
influence in the dialects needed to be looked into as one of the major 
aims of the Harijan dialect survey. 


R.V.K. Thampuran, A.P. Andrewskutty and I stayed at 
Mannarghat, Tamarasserry and Tellicherry respectively for about three 
months, collected materials for the Harijan dialect survey and made an 
analysis of the PaRaya dialect and prepared preliminary reports 
showing the characteristic features. 


A few characteristic features of the PaRaya sub caste of the 
Harijan community may be noted. 


A regular dropping of almost all word initial consonants p, t, c, k, 
h, v, n n is a characteristic feature of this dialect whereas only the 
dropping of h in such words as haaram > aaram ‘garland’ haajaru > 
aajaru ‘presence’ is noted in the other dialects viz. Ezhava/Tuya, Nair 
and not any of the other consonants in this position. 


E.g.: peeti > ceti ‘fear’ betakkaaya > etakkaya ‘bad’, toolu > oolu 
‘skin’, ceecci > eecci ‘elder sister’, kaaRRu > aaRRu ‘wind’, veralu > 
eralu ‘finger’, haan > aanu ‘T’, nennhu > innu ‘chest’. 


Occurrence of nasalized vowels: E.g.: 1 < muinu ‘fish’, itu < miitu 
‘face’, dle < tavala ‘frog’, utte < muutta ‘father’s mother’. 


The occurrence of pronouns: atoon ‘he’ atalu ‘she’. The corresponding 
forms for these in other dialects are avan, aval. 


Peculiar lexical items like iippalu ‘below’ veere ‘blood’, olappe 
‘flesh’, veyyi ‘cow’, aampu ‘bedroom’. 


The PaRayas of Hosdurg use the present tense markers -anRa 
and -unta. E.g.: kottanRa ‘chops’, itunta ‘places’. The present tense 
markers in other gee of Malayalam are kottunnu ‘chops’, itunnu / 
itunu ‘places’. , 
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In Malayalam, the reflexes and substitutes of 1 to a great extent 
demarcate the caste/communal and regional dialect boundaries. It’s 
reflexes as observed in various dialect Malayalam especially with those 
of the Ezhava/Tiiya dialects (Subramoniam. 1974) spoken throughout 
Kerala are noted: |, y, t, O, yj, yy, jj, V, 2, 2g, $, §, h, I. 


1 is retained in the literary dialect and the standard colloquial 
speech of Malayalam. It occurs in all districts of Kerala. For instance, in 
such words as kooli ‘hen’, vali ‘path’, kelakku ‘east’, kelannu ‘tuber’, 
pulu ‘worm’, kaluttu ‘neck’, kalukkoolu ‘bamboo pole’, kolu ‘blade of a 
plough’, kaluta ‘donkey’, taale ‘below’. 


y as a reflex occurs in the Harijan and Muslim dialects besides 
the uneducated Ezhava and Nair dialects of the erstwhile Malabar 
region, predominantly in the northern districts of Kerala viz. Kannur, 
Kozhikkode, Malappuram and Palghat. E.g.: 1 is replaced by y in all the 
above items as kooyi ‘hen’, vayi ‘path’ etc. 


t occurs in a few instances in the Tuya dialect spoken in the 
northern most villages of Kasargod in Kannur district, which is one of 
the marked features that separate it as a distinct dialect area. It may be 
a phonemic replacement rather than a regular reflex, as it is not found 
in any of the Dravidian languages and dialects investigated so far. E.g.: 
ketakku ‘east’ ketannu ‘tuber’, putu ‘worm’ etc. 


® loss of 1 with vowel lengthening in verbs before past tense 
morpheme -nn-, such as viinu < vilunnu ‘fell’, taanu < taalnnu ‘down’ 
in most of the dialects. It also occurs in nouns such as keennu ‘tuber’ 
keekku ‘east’ mainly in Kannur and Kozhikkode districts. 


yj, YY, J} Which are peculiar to Muslim dialect is also noticed less 
frequently in the Ezhava/Tuya dialects of Malabar due to contact with 
the former. E.g.: puyju / puyyu / puyju ‘worm’. 


v occurs in Kannur, Kozhikkode and Palghat. E.g.: kovu ‘blade 
of a plough’, kavuta ‘donkey’, kavuttu ‘neck’. 


g, gg occurs in Kozhikkode and Palghat. E.g.: kogu/koggu ‘blade 
of a plough’. 


§ occurs sporadically in Palghat. E.g.: koosi ‘hen’, pusu ‘worm’. 
§ occurs in Kannur and Kozhikkode. E.g.: pusu ‘worm’. 


h occurs in Kozhikkode. E.g.: kahuttu ‘neck’, kahukkoolu 
‘bamboo pole’. , 
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| occurs sporadically in Trivandrum due to Tamil influence in 
such instances as kooli ‘hen’, kal uta ‘donkey’, kel annu ‘tuber’. 


It should be noted that the dialect studies and surveys conducted 
so far in Kerala in the Malayalam language situation especially with 
reference to caste and region show that all caste differences cannot be 
explained by two or three fold division such as Brahmin: non-Brahmuin, 
or Brahmin: Non-Brahmin: Harijan etc. Each community, caste and 
even sub-castes within the caste groups have to be studied 
independently and exhaustively. For instance, the Non-Brahmins 
include such castes, as Nairs, Ezhavas, Harijans and the Harijans again 
comprise of such subcastes as Pulayas and PaRayas. We cannot say that 
all these castes represent the same dialectal features; there 1s 
considerable difference among the dialects of these groups. 


Regarding the regional variations, extreme differences are 
observed in the various levels of linguistic structure including the 
intonation patterns of the speech of the same caste group or different 
groups living in two different areas, for instance, the Malayalam speech 
of any group of the northern region vs. that of the southern region 
which can only be attributed to the region since these features are 
almost uniform for any group of the respective regions. Regarding 
lexical items there are many with purely regional connotations, the 
same forms in two areas have two different meanings and also there are 
forms which are taboos in one region is not so in another region. 
Similarly certain verbs and nouns have co-occurrence restrictions at 
regions. Within a state, language and a community the speech forms 
show considerable differences especially when used in_ large 
geographical area and unless they are systematically surveyed many 
aspects of the language would not be known even to the native 
speakers. I was unaware of the numerous variations at phonological, 
grammatical, lexical and semantic levels with regard to several words 
prior to my field work where all these are directly observed - a 
tremendous first hand experience. There are different styles of 
Malayalam speech within Kerala and it would be interesting to study 
what factors influenced those differences. 


With the spread of education many people were trying to hide 
their regional or caste/communal identities by using standard forms as 
well as English words and thus a new dialect is emerging which could 
be called as standard colloquial Malayalam. 
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Appendix 1 
Front Page of the Questionnaire 


Investigator Date 


UNIVERSITY OF KERALA 
DEPARTMENT OF LINGUISTICS 


SMALL-SCALE DIALECT SURVEY 


QUESTIONNAIRE 


Preliminary Details and Instructions 


District Panchayat Village 

Name of Age Occupation 

Informant 

How long is he Birthplace If married, 

a resident there? where is his 
wife from? 

His travel Other communities Educational 


experiences in the locality background 
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Appendix 2 
Model of the Inner Page of the Questionnaire 


QUESTIONNAIRE 


I. What is the name you use for: 


1. ink maési — 
masi aA 
2. hen kooyi — 
kooli = 
3. water b/vellam = 
vellam — 
4. well done ceelu aayi — 
Seelu aayi —_ 
5. father is there accan avitintu 


accan avite untu —_ 


accan avitontu or 


6. groundnut oil kataleenna .s 
katalenna — 


—7.bed kitakka =o 
pes kitaykka — 
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Appendix 3 
Model of the Frequency Chart showing 
Variants for Item 1. Ink, 2. Hen 


FREQUENCY CHART OF VARIANT FORMS 


Districts with Total Number of Panchayats 


Items & GC Pr ape ek OTe 
Variations 114 147 114 89 101 89 96 96 75 
1. Ink 

masl - 49 2 Zz - jae <— - 
masi 203, 42 5.20% 867 10-87 763 Sl 68 
masi Sl . 99°, 9 1 : 2 a ee, 

magi - «92 he > 7 m= 
(retroflection) 

masu 2 1 - — > Hs 7 - 
maci 1 - - -. - = fey 
mali ee ee. BAD ee 
mayl - win Zh - = - —f = - 
saayl 10 - - . 4 - a. 
2. Hen 

kooli 6 7h UT BT TG ee 9G «206 74 
kooyi 108. W121 14. <2 . i. 
kooli - - . tes ‘elie oes | 
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Appendix 4 
The Base Map of One District Palghat showing the Panchayats 


4 


Palghat District 
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Appendix 5 
An Isogloss Map showing South Travancore Dialect Area 


[Thick line indicates bundle of isoglosses separating dialect 
boundaries; Dotted line shows district boundaries] 


Outlon 


lt | eae 


sor a 
fr Trvandrum 
B - Bundle of isoglosses a 
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Appendix 6 F 
A Map of Kerala showing 12 Dialect Areas of Ezhava/Tiiya Dialect 


[Thick line shows dialect boundaries; Dotted line shows district 
boundaries] 
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=. A DESCRIPTIVE ANALYSIS OF A DIALECT OF TAMIL !5 
=, V.I. Subramoniam, 2003, HB, Demy 1/8, pp. xiv+85, Rs. 170/- (US$ 17/-) = 


= The thesis used only by researchers in the Kerala : 


=| University and later in the International School of ‘= 
=, Dravidian Linguistics, has come out now in attractive i= - 
=, format. When voicing of the written script is now attempted ‘= 
=, in several centres, its acoustic study helps to voice the Tamil ‘= 


=: sounds. A pathfinding thesis completed in 1957. - 
= _A HANDBOOK OF TAMIL NADU : 
=, K.M. Venkataramaiah (Ed.), 1996, HB, Demy 1/4, pp. vit556+xiv, '5 
=; Rs. 1,000/- (US$ 125/-) : 
“{ Arranged alphabetically, this book covers most of the 


=: details on history, culture, language and literature, Saivism, ‘= 
=, Waishnavism, Buddhism, Jainism, Christianity and Islam, i= 
=, and has pen-pictures of important personalities. Running to t= 
=, Over 576 pages, with an appendix and index, it is intended t= 
=, not only for researchers as a ready reference but also as one {a 


=t catering to the needs of the common readers to know the = 
=" land, people, history, culture, language and literature, and = 
=} religion of Tamil Nadu through the ages. f 
S| BA:LAVYA:KARANAMU OF PARAVASTU CINNAYA SU:RI_ = 
~ P.S. Subrahmanyam (Tr.), 2002, HB, pp. iv+xliv+382, — 
=| Rs. 1,200/- (US$ 120/-) - 
= This magnificently-produced volume in 10 chapters 7 
=" with 3 appendices opens our eyes to Telugu grammatical 5 
=' tradition. Though Panini is mentioned, the Katantra pattern - 
=! which is also the pattern of Tolkappiyam, is followed. For ‘= 
+t describing the grammatical tradition of Telugu, this work is i‘ 
= indispensable. The English translation is pleasing and 3 
I precise. — E 
satin Qe Ay aR RAREST DEI 5. LOPS EY Sop ir 


OTOL 


International Journal of Dravidian Linguistics 
Vol. XXXVII No. 1 January 2008 


Notes and Discussions 


ORIGIN OF THE DRAVIDIAN UNIVERSITY. 
VI. SUBRAMONIAM 


A call in the early morning of 19th August 2002 from Vadasery 
to Thiruvananthapuram conveying the telephonic message that S. 
Kasipandian expired and the last rights will be performed at 2 p.m. on 
that day. | 


The news was shocking though it was not unexpected because of 
S. Kasipandian’s failing health during his stay in Porur, Chennai. 


Soon my brother, A. Raghavan, phoned up to convey the same 
message. I requested him to be present during the performance of the 
last rites and place a wreath on our behalf. The quickest means to reach 
Madras will be a flight but that starts from Thiruvananthapuram only at 
5 p.m. to reach at 8 p.m. on that day. My brother offered to send his car 
to take me to the house of Kasipandian at Porur from the airport. 
Sadness engulfed me throughout the day. 


After forty minutes, I was to leave the house of Kasi at Porur. My 
eyes were full of tears. The voice was thick. It was that of a weeping 
soul. The daughter was called by Umashankar. Mrs. Kasi was in a side 
corridor. The voice of sorrow was audible and moved any listener. I left 
the house with wet eyes and a choking voice. 


In the night I could not sleep because of the sadness. So I 
decided to take a bus back to Nagercoil cancelling my reservation in the 
Anna University guesthouse. The pangs of sorrow are much more 
painful than the physical discomfort of the bus travel for fifteen hours. 


* From Amarar Kasipandian, C. Umashankar (Editor), Chennai. 
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A saga of friendship with Kasi had ended. 


Kasipandian whom I called when he became intimate as ‘Kas?’ 
was an I.A.S. officer. He was a student of the St. Xavier’s College, 
Palayamcottai and had respect for his Tamil Professor, A. Arulappan, 
for whom I had regard and later Arulappan became a member of my 
family. Kasi had his Law course at Chennai and had a stint of a year at 
Cambridge when he was Secretary to the Government of Andhra 
Pradesh. His love for Tamil and Telugu and for Dravidian culture was 
infectious. 


I was associated with the Planning Board of the Tclugu 
University, Hyderabad when I was the Vice-Chancellor of the Tamil 
University, Tanjavur. This position was continued even after my 
dissociation with the Tamil University. One: day when I attended the 
meeting of the Planning Board, N. Sivarama Murthy who served as a 
lecturer in the Department of Linguistics at Kerala, and shifted to the 
International Telugu Institute under the Telugu University, mentioned 
about Kasipandian, the Secretary for Education in the Government of 
Andhra Pradesh and said that he was very helpful to the Telugu 
University for its growth and in the allotment of funds; more than what 
the University had asked for. An appointment was fixed almost within 
hours and when Sivarama Murthy and I went to his chamber, a call 
came to meet the Chief Secretary and Kasi replied that he will come 
after five minutes. So hardly four minutes were available for introducing 
my subject. The latest number of the /nternational Journal of Dravidian 
Linguistics was presented to him. On turning its pages and with glee and 
shine in his eyes, he said that such ventures should be supported by any 
government. We left his room so that he may walk through the narrow 
corridor of the Secretariat to meet the Chief Secretary. 


On my return to the Telugu University, I recalled my efforts 
almost from 1977 meeting the Minister for Education, the Secretary for 
Education in the Government of Andhra Pradesh and very seldom the 
Chief Minister of Andhra Pradesh who rarely gives appointments to 
unknown faces like me, to realize the ambition of bringing together the 
four major languages of the Dravidian family and then all the tribal 
languages of that family. In some States, especially in Tamil Nadu, after 
some persuasion and securing the support of some leading political 
leaders like K. Anbazhagan and V.R. Nedunchezhiyan, I was able to 
secure a modest grant of 1.35 lakhs yearly for five years and 4 lakhs for 
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building. Karnataka also agreed due to the influence of P. Nedumaran 
who was a member of the Working Committee of the Indian National 
Congress by speaking to the Chief Minister, Sri. Devaraj Urs. Kerala 
came later after watching the progress of activities of the International 
School of Dravidian Linguistics but it gave a grant of 5 lakhs for 
purchasing a plot of land of 27 acres in the St. Xavier’s College campus 
for 6.25 lakhs. The balance in the price and the registration charges 
were met by the Dravidian Linguistics Association. So tackling the 
Andhra Pradesh Government became important. My effort to create 
interest in the Central Government was not fruitful. Kasi’s help in the 
active participation of Andhra Pradesh was immense. He knows how to 
succeed in securing sanction for any proposal from the Minister 
concerned and the Chief Minister, through his notes and his effort to 
explain the project of common interest to the friends and admirers of 
the Chief Minister who have an inclination for the implementation of 
such projects. All knew his honesty, devotion to work and love for 
Telugu and his care for the achievements of the Chief Minister under 
whose Ministry he was serving. All these and another factor that he was 
a member of the Scheduled Caste made him a favourite of ministers 
and chief ministers. The Congress was strong in Andhra Pradesh when 
I tried my luck initially. There were pleasing assurances but no yield 
came in the form of grants. I had a false belief that the intrinsic merit of 
any proposal will elicit grants. Soon I was made to realize that it is the 
person behind the proposal and the gains for the Ministers from the 
proposal that matter for sanctioning. Kasi knew his pathways for getting 
any good proposal sanctioned. 


Initially, my effort was to secure the grant of the Government of 
Andhra Pradesh for a building in the name of Nannaya. Through the 
Telugu University, he sanctioned about 6 lakhs for a Nannaya Block. To 
get the money was not easy, even after securing the Order of 
sanctioning the grant. The Personal Secretary to N.T. Rama Rao was a 
Scheduled Caste Officer. So Kasi’s files passed through in that office 
quickly. While sanctioning, N.T.R. seemed to have observed "why 
should you give money to distant places" but sanctioned the grant. Kasi 
later informed me this when I met him. 


The Telugu University had a Registrar who was a member of the 
Scheduled Caste. He was reverted to the Education Department. 
Before relinquishing office, he released the first instalment of 2.5 lakhs. 
The balance grant of 3.5 lakhs was later released by the Andhra 
Government because of the help of Kasi. 


ij! Vol. XX XVII No. 1 194 Subramoniam 


Meanwhile, Kasi advised me to have a function for laying the 
foundation stone in the I.S.D.L. with the Minister for Education, Sri. 
Muddu Krishnamma Naidu, on 26th December 1986. It was a rainy day 
and none can predict the climate in Thiruvananthapuram. Yet the 
function went on well. The progress of building work was swift. The 
building was planned to house the researchers living with families and it 
is the largest building in the campus. When it was completed, Kasi 
advised to convene a seminar on Dravidian Philosophy along with the 
inauguration of the Nannaya block on 19th November 1987. The 
Speaker of Andhra Pradesh Assembly, Sri. Narayana Rao and the 
Minister for Education, Indira Reddy, came for the function. Kasi was 
present in the two functions. Even a brief mention of his help in 
securing the sanction of the Government of Andhra Pradesh did not get 
his approval but I did not fail in my duty to express my gratitude. 


By that time, Kasi succeeded in organizing the Dravidian 
Philosophital Research Institute (DPRI) at Tarnaka. N.T.R., the Chief 
Minister, had a liking for Justice Avula Sambasiva Rao who was a 
former Chief Justice of Andhra Pradesh. He was made the President of 
the DPRI. A library was organized. A modest grant was secured 
through the efforts of Kasi from the Government of Andhra Pradesh. A 
seminar on Dravidian Philosophy giving importance to Vamana’s 
Philosophy and Saiva Siddhanta was organized and Professor K. 
Anbazhagan who was then the Minister for Education in Tamil Nadu, 
inaugurated. The then Chief Minister, Vijaya Bhaskara Reddy, could 
not come but sent his Minister for Education. The seminar attracted 
attention and the DPRI. A corpus of about a lakh of Rupees and a 
library of thousand books were collected and later when the Dravidian 
University began its operation, both were handed over to that 
institution at Kuppam. N. Sivarama Murthy and later K. Ramakrishna 
Reddy, both of the Telugu University, were associated with the DPRI. 


Simultaneously, the work for the proposal for the organization of 
the Dravidian University was actively pursued. A detailed write-up was 
already published in the volume of the Felicitation presented on the 
Seventieth Year to Professor K. Anbazhagan. Kasi was a Secretary for 
Panchayat Raj and Municipal Administration. On one evening, in his’ 
chamber, this idea was proposed and I was delighted and took it 
seriously. He had arranged for-a visit to Chennai and I was contacted by 
phone to join him to go to Kuppam to select a site. A multilingual area, 

especially in the border districts of Andhra Pradesh or Tamil Nadu, 
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was searched for as a venue for the University. I rushed to Chennai and 
met him in the Government Guesthouse there and we planned to go to 
Kuppam on the next day by the Brindavan Express. I purchased a 
second-class ticket for him also but he had already purchased one and 
was seated when I reached the platform. I could not return the ticket to 
collect the fare because of the limited time. So I gave my address and 
the ticket to collect the money and sent it by Money Order with a 
commission to a man who was a stranger but stood in the queue. 
Neither the money was returned nor did he write any note to 
Thiruvananthapuram. 


The setting up of the Dravidian University was the major part of 
discussion in the train. We reached Kuppam by 12 p.m. A collection of 
officers welcomed him and we all went to see the various sites available 
as purambokku land. One at Kudippally, which might have once been a 
Jain centre, was seen. A Malayali chainman was helping in measuring 
the land. Andhras, Tamils and Kannadigas were there in the group. The 
extensive land had rocks. I climbed up but Kasi did it slowly. The 
golden rock of Kolar was visible. We moved on to another site which is 
in the interior forest. To get that land needed the permission of the 
Government at the Centre because it is forest-land. That we suspected 
will not be easy. So broadly the Kudippally land was decided for setting 
up the University. We both returned to Madras. He took the 
Hyderabad Express and I returned to Thiruvananthapuram on the next 
. day. 


A Committee was formed with Justice Avula Sambasiva Rao as 
the Chairman, K. Mohanarangan (former Director of Collegiate 
Education of Tamil Nadu), Professor S. Shantha Veerappa (the former 
Principal of the Maharaja’s College, Mysore) as members and I was its 
Member-Secretary. A meeting was arranged in the Srikumara Kriba 
Guesthouse at Bangalore. All expenses for travel, stay, etc. were met by 
I.S.D.L. Kasi had his train ticket purchased by him. We both stayed in 
the same room in a hotel. Only the Chief Justice was treated as a State 
Guest. The draft statutes of the Dravidian University brought by me 
from Thiruvananthapuram after getting it typed by my daughter and the 
typist of I.S.D.L., Mrs. Rethi, was scrutinized clause by clause. The 
draft was finalized after reading closely the Model Act of the U.G.C. 
and the Tamil University Act, which were available with me. On the 
next day, we all visited Kuppam in a taxi from Bangalore. The Justice 
was received with a garland and a cheering crowd. Kuppam was 
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predominantly a Naidu-dominated area. N.T.R. took interest because 
of this fact. Justice Avula Sambasiva Rao, who was visibly pleased, 
listed the points which are advantageous for setting up of the 
University, like rainfall, facilities for travel, availability for land, etc. The 
Draft Act indicated the sharing of the expenses in setting up of the 
Dravidian University by the four Southern States and by rotation of the 
Senior Officer like Vice-Chancellor, Finance Officer, Registrar, etc. 
among the four states. A letter for forwarding to the Chief Minister of 
Andhra Pradesh was drafted very attractively by Kasi and signed by all 
members. It was an important meeting. After that, the Justice was sent 
by plane to Hyderabad, others by train and I left in the evening for 
Thiruvananthapuram. Kasi’s ability to propose a project and get that 
implemented was very obvious. 


Only a few in Hyderabad will recall that Kasi was responsible for 
installing the statue of Mother Telugu Telugu Toll. Even the names 
‘Telugu Desam’ for N.T.R.’s party and ‘Mahaa Naaduw’ were suggested 
to N.T.R. by Kasi. No wonder that N.T.R. treated Kasi with affection 
and regard. 


Soon N.T.R. was prevailed upon by Kasi to invite all members 
for a discussion. The Chief Minister invited all members for a breakfast 
in his residence. The Chief Secretary, Education Secretary and other 
important officers including Kasi were invited. As a host, N.T.R. was 
superb. I was asked to explain. I did it stressing the cultural unity and 
the co-operation between the Southern States. N.T.R. seemed to accept 
the proposal and asked to formulate the steps to implement the 
proposal to Justice Sambasiva Rao. 


Soon the Council of Direction of the IL.S.D.L. proposed all 
members of the Drafting Committee to be members of the 
Implementation Committee. The Committee met at Hyderabad and 
proposed that: 


(a) the budget be shared by the four Southern States and the Centre, 
(b) the top posts should be shared by the four States, 


(c) all policy-making bodies should have representation from the four 
States and other funding bodies, 


(d) the land of 1,090 acres be provided free of cost by the Government 
of Andhra Pradesh to the Dravidian University. 
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Two events which were'‘not conducive for the implementation of 
the Dravidian University proposal took place after this. In the gencral 
election, the Telugu Desam party was reduced to a minority and the 
Congress came to power. Soon Kasi took up a central-government job 
in the Food Corporation of India and was posted in Chennai. The 
Secretaries at Hyderabad who were opposed to the Dravidian 
University proposal felt that it was buried with the departure of Kasi. 
They all were enthusiastic in the formation of a Sanskrit University in 
each State including Andhra Pradesh. 


The Dravidian University proposal secured support from 
unexpected quarters due to the efforts of the I.S.D.L. The I.S.D.L. for 
the Chairmanship of its Governing Council invited Dr. B. 
Ramachandra Rao who was the Vice Chairman of the U.G.C. whom I 
know a bit. He readily accepted the invitation. In the Annual 
Conference of the D.L.A. at Udupi, I briefed him about our attempts to 
set up a Dravidian University at Kuppam and how it had not 
materialised. The then-Minister for Human Resource Development 
was Sri. Arjun Singh whose close ally was Sri. Ajit Jogi. Dr. 
Ramachandra Rao was a friend of him. So Dr. Rao spoke to Sri. Ajit 
Jogi about the unity move in the South by setting up a Dravidian 
University and the four States financing it. Jogi discussed with Arjun 
Singh who said when he visits Thiruvananthapuram within a week he 
will visit the I.8.D.L. So Dr. Ramachandra Rao wanted me to organize 
a function in the I.S.D.L. and Arjun Singh was requested to maugurate 
the Hospital Building named after Kerala Paanini, for studying the 
mentally-retarded children. He readily agreed. Hon. K. Karunakaran 
was invited but he could not attend. After the meeting, which went on 
well, in which Arjun Singh, said that I.S.D.L. should have advertised 
more its activities, he was led to an air-conditioned room in the 
computer section where he said that such an institute is necessary and 
that it should be an institute like the Tata Institute of Fundamental 
Research at Bangalore. It may be registered and he will write to all 
Chief Ministers in the Southern States to fund it and the Department of 
Education will give an equal amount. A friend of Kasi is Dr. S. 
Chellappa I.A.S., who was in the Government of Andhra Pradesh. He 
helped in registering the Institute at Hyderabad and Sri. Arjun Singh 
was informed. He wrote to the Chief Ministers in the four States. He 

et Jayalalitha at Delhi in a lunch and explained to her the good move 
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and she agreed to fund the institute. However after returning to Madras 
she wrote a negative reply. Undaunted, we tried for a University or for 
an Institute for Dravidian. Kasi at Madras was informed about the 
developments then and there. 


At Delhi, the Ministry of Education had a Secretary from Tamil 
Nadu who would keep the file delayed in some pretext or other. Dr. 
Ramachandra Rao once told me that Arjun Singh was so irritated that 
he told the Secretary to write whatever he wants but send the file to 
him. Dr. Ramachandra Rao did not share with me my interpretation 
that the Brahmin administrators at Delhi are determined to kill the 
proposal. 


Soon the file was referred to the U.G.C. to examine the draft 
report about the feasibility of the Dravidian Institute and its financial 
layout. Though all those points were clearly stated in the introduction to 
the draft of the Institute, the Secretary wanted the U.G.C.’s opinion. 


The Chairman at that tme was Dr. Ram Reddy who was known 
to Dr. Ramachandra Rao intimately. So I invited the Chairman of the 
Committee named by the U.G.C., Dr. M.V. Pylee, to visit the I.S.D.L. at 
Thiruvananthapuram after visiting Kuppam. The Committee’s date of 
arrival at Thiruvananthapuram was notified to Kasi who was then a 
member of the Administrative Tribunal at Kochi. He also came on that 
day. The secretary for the Visiting Committee was Dr. Gangurde who 
was a member of the Scheduled Caste. So Kasi’s words carried with 
him. The Committee gave a good report but Kasi could not get a copy 
of the report. I had to visit Delhi for the meeting of the Jnanapith 
Central Committee and visited Gangurde who gave a copy, the only 
copy with him to read. I took a xerox copy and gave his copy to 
Gangurde. Later, Kasi was informed. Soon Kasi was appointed as the 
State Election Commissioner in Andhra Pradesh. He asked my opinion 
for accepting it and I said if he is independent of the Central 
Commission, please accept that. He moved to Hyderabad. 


Janardhana Reddy was the Chief Minister and Kasi was active. 
The Committee under Justice Avula Sambasiva Rao and B. 
Ramachandra Rao and I along with Kasi met the Chief Minister. He 
asked why the University is to be started in Andhra Pradesh. As answer 
to the unexpected question, in my own way I pointed out the great 
Amaravathy culture of the Buddhists, migration to the Far East, etc. 
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from Andhra Pradesh. A press note was issued by Janardhana Reddy’s 
office that 1,090 acres and twenty crores were provided for the 
Dravidian University but the Order was not issued in spite of Kasi’s 
persuasion. 


Soon, the Assembly elections reinstated N.T.R. Kasi had 
enormous influence with N.T.R. He arranged for the Committee to 
meet in the chamber of N.T-.R. in his house. N.T.R. enthusiastically said 
that he would provide 170 crores for the Dravidian University and the 
Ordinance for the University will be issued soon. A hand of N.T.R. was 
paralysed. Seeing this, I left the bulky volume of the Dravidian 
Encyclopaedia to be presented to him, on the table and he was visibly 
moved. A news conference was arranged. I was asked to sit by his side. 
However, Kasi went away as soon as the discussion was over. Muddu 
Krishnamma Naidu, the Minister for Education, asked me to go with 
him in the car to his office. In the car, he asked me to take up the 
Vice-Chancellorship of the Dravidian University. I with folded hands 
said, "please nominate someone else". The bodyguard in the front seat 
laughed in amazement by my reply. Later I told Kasi in his office about 
this. He told me it is to outwit Chandrababu Naidu who will nominate 
his own candidate. I have no knowledge of the crosscurrents of politics 
in Andhra Pradesh whereas Kasi was well-informed and effectively 
handled them. 


Soon N.T.R. was replaced by Chandrababu Naidu. Kasi was 
again in the picture to persuade Naidu to implement the Dravidian 
University proposal at Kuppam from where he was elected. Two years 
before that, the Annual Conference of Dravidian Linguists was held at 
Kuppam in the compound of an elderly Naidu who welcomed our 
efforts in setting up the University. N. Chandrababu Naidu was invited 
to speak at the instance of Kasi though the Congress Minister for 
Education was to preside. Kasi had the foresight to carry through the 
proposal by roping in people who matter now and later. 


N. Chandrababu Naidu, when he became the Chief Minister, 
called me and P.V. Arunachalam who was introduced as a poet in 
Telugu by Kasi in one of my visits to Kuppam and asked me my opinion 
about him. I said I may agree if Kasi knows him well. In the meeting at 
Tirupati, the steps taken by Tamil Nadu to implement the Tamil 
University by issuing an Ordinance was mentioned to Chandrababu 
Naidu. The Draft University Act as approved by the Expert Committee 
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headed by Justice Avula Sambasiva Rao was also mentioned. It was 
indicated that an ordinance will be issued for the Dravidian University. 
I came away from Tirupati by the night bus to Madras. , 


Soon, the Dravidian University was passed by the Legislative 
Assembly. The Governor, Krishna Kant, delayed its approval by raising 
objections. In the Annual Conference at Madras, M. Karunanidhi who 
presided over observed that Krishna Kant who is a friend of his will 
fulfil the aspiration of the people of the South by assenting to the Act. 
Soon, Krishna Kant agreed to the Act. My name appeared as 
Pro-Chancellor and P.V. Arunachalam as its Vice-Chancellor. A major 
part of the credit for setting up of the Dravidian University should go to 
Kasi who never claimed it but I never forgot it nor did I fail to mention 
this in appropriate forums. 


Kasi once told that when the Departments and Courses of Study 
were finalized in a meeting of the Syndicate / Academic Senate, the 
retired Brahmin officer met almost every evening and prevailed upon 
C. Rangarajan, the Governor, to negate its progressive programmes. In 
one instance, he asked the Vice-Chancellor, P.V. Arunachalam, to go 
the Chancellor and get a suggestion approved which Arunachalam did. 
Later Kasi told me that he had to interfere and said that he be excused 
for that. 


After his tenure as State Election Commissioner in Andhra 
Pradesh, Kasi moved to his house at Porur, Chennai. This departure 
from Hyderabad weakened my contact in the Secretariat of Andhra 
Pradesh. To reach the ears of the ministers in the best interest of the 
University became very difficult. C. Rangarajan was cautious in every 
act and would not offend anyone in Andhra Pradesh, especially N. 
Chandrababu Naidu. I had enjoyed the confidence of governors like 
Sadique Ali or Korana at Chennai or Kang in Kerala. I found the Raj 
Bhavan at Andhra Pradesh slightly formal and undependable. This was 
shared by Kasi. He soon felt that another university should be started in 
Tamil Nadu. He himself organized an association in the name of 
Vallalar and gave prizes of Rs. 25,000/- to two scholars: one to 
Gnanasambandham and another to me. The money 1s his gratuity he 
received after retirement. So I quietly returned it, appreciating his 
outlook to honour Tamil scholars. 


Kasi tried another project for helping scholars to appear for the 
L.A.S., LP.S., etc. from the rural areas. He wanted SC/ST students and 
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the backward-class young men to be trained for a mimimum fee to sit 
for the central examinations. He built a two-storied long school-like 
building in a quiet place at Shencottah on his wife’s property. My advice 
was not to go in for this venture, which in a rural area will be 
non-productive. He would not agree. I am to support him even when he 
disagreed with me. One who was principal in Hyderabad agreed to 
come from Hyderabad to head this school. Soon he backed out when 
the remuneration was not sizeable. To see that Kasi’s efforts are not 
failures, I travelled from Nanguneri where I go for village work on every 
Sunday to Shencottah and took classes. The questions in the LA\S. 
where ‘Yes’- / ‘No’-type and my lectures did not satisfy the students’ 
requirement. Kasi borrowed money, about six lakhs for the building. 
Whether he had repaid the borrowed money, I do not know. Later, the 
course was wound up. 


Another two events in which I was associated were the 
inauguration of his house and the marriage of his daughter, a 
well-behaved and affectionate girl, educated and modest. I happened to 
attend the house-warming ceremony of his new house in Porur on 
invitation. He asked me all of a sudden to open the door and enter the 
house as a first member. Normally if informed earlier, I might have 
declined and asked him to select an elder for this. Knowing me well and 
because of the respect and the affection for me, he asked me to open 
the door in the last moment so that I might not back out. 


For the marriage of his daughter with Mr. Umashankar who is in 
the administrative service, I was there with my two brothers, A. 
Raghavan and A. Mahadevan. All of a sudden, the bride and groom 
came to perform paada puja to me at the instance of Kasi. Embarrassed 
I underwent and blessed them. They remain my daughter and 
son-in-law even today. 


I entered the house of Kasi at Porur on the day he was buried. It 
might have been 9 p.m. The youngest brother led me in. His son-in-law 
who was looking at the office papers stopped his work and narrated the 
last days. Kasi needed medical attention. He loved his grandchildren. 
All were there when life ebbed out of his body. He said that his body 
should be buried in a Christian cemetery for which Umashankar 
obtained permission to bury in Kilpauk Garden cemetery. He remained 
a devote Hindu and got him buried in a Christian cemetery. Kasi lived 
for certain ideals. Impeccably honest in his work, he had unsuppressible 


ijd] Vol. XXXVII No. 1 202 Subramoniam 


love for Tamil and Telugu and the Dravidian culture. All his friends 
were and continue to be mine also. He was a brother while alive and 
still, even after his death. 


People who devote their life for the cause of language and 
culture of the South are few. Those who do that without publicity are 
fewer. Kasi did his best for the languages - Tamil, Telugu and for 
Dravidian - which very few know. Fewer still will be those who 
document those acts. I had done this as a matter of duty. 
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Notes and Discussions 


FOOD TALK IN ORIYA’ 


B.N. PATNAIK 
Central Institute of Indian Languages 
Mysore 


This paper deals with what may be called "food talk", and is 
organized in three parts: the first is devoted to food talk in the so-called 
standard dialect Oriya, which is the central concern of the paper, the 
second suggests that such talk may not be restricted to this language, © 
and the third offers a brief and tentative account of how part of food 
talk has the features that it has. 


I 


Food and food codes in the Indian context are among the more 
well-discussed topics in Indian social anthropology and folklore studies, 
A.K. Ramanujan’s being among the most insightful, as far as the latter 
is concerned. There is some interesting writing in Oriya language 
mainly by literary critics and folklorists on Oriya food and food codes, 
but to the best of our knowledge, food talk has not so far drawn the 
attention of the Oriya linguists, literary critics and scholars working on 
this language from other perspectives. Food talk uses the vocabulary of 
food, and is seemingly about food, but is really not about it; thus 
descriptive terms belonging to the discourse on food such as "cooked / 
non-cooked", "sacred / non-sacred", "polluted / non-polluted", or the 
traditional Indian saatwik / taamasik etc. do not apply to it. In 
conversation it performs certain functions, such as conversation starter / 


1. ‘My thanks are due to Atiur Rehman Khan for Bengali data, and Kannada data, 
G. Bharathi for Telugu data, and Vikas Kumar for Hindi data and for his helpful 
observations on an earlier draft of the paper. Thanks are also due to Yudhistira 
Parida and Subhra Sahu with whom the Oriya data were checked. 
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facilitator, marker of informality, even indicator of good will in certain 
contexts, etc. Food talk is part of the informal use of language; more so 
of the rural population than of the urban, which may be one reason why 
it has not been noted for study. It does not find mention in the 
collections of Oriya folk literature, but it may be because these are not 
among the stuff the folklorist is generally concerned with, such as 
proverbs, riddles, wisecracks, tales, songs, etc. These do not occur in 
folk tales too - one place where one would expect them to occur, these 
being part of ordinary conversation. Food talk has also escaped the 
attention of the Oriya socio-linguists. Careful study of discourse in 
Oriya is yet to be undertaken, which may constitute yet another reason 
why food talk has not drawn the attention of the scholars so far. 
Consider the following: 


(1) khaailani / khiaapiaa helaani? | 
eat-perf-2p / eating-drinking - be-perf - 3p 


(Have you eaten / Is dining over?) 


This is part of the linguistic etiquette system, called kushala 
pachaanibaa (asking about the other’s welfare) in Oriya; it’s a form of 
responding to a customary greeting or a part of conversation starting 
strategy. It can be used only after meal time, and would be 
inappropriate if used earlier, say, soon after sun rise, which is the time 
for worship, or to go to work, etc. "Meal time" is clearly not a fixed time 
concept; it is a matter, among others, of the economic condition of the 
family concerned, and the job requirements of a certain individual. The 
speaker of (1) is not really interested in or concerned about whether the 
hearer has dined. One advantage of (1) as conversation starter is its 
relative neutrality of tone; it isn’t too personal, yet it isn’t distant and 
formal.> It invites comparison with the polite, phatic "how-do-you-do" 


2. When not used in this way, the expression has only the literal sense. When 
spoken to a child, it is an attempt to make him (or her) feel good, and 
encourage him to talk to the speaker. In our villages, as children, we grew up 
with it fifty years ago, and even today almost the same situation obtains. 


3. Interestingly, eating in a traditional Indian - probably Hindu - thereby Oriya, 
household is a very personal and private activity; one ordinarily dines with the 
members of one’s family and presence of others while dining is in progress is 
not the norm. There are food offerings one must share with the members of 
one's family alone; whereas prasaada (food offered to gods) must not be eaten 
alone, and must be shared, the food offered to Lakshmi, goddess of prosperity 
and wealth, is not given to anyone outside the family. 
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or weather talk in English. As far as the conversation starter and 
greeting features are concerned, these are equivalents, but there is a 
difference; there is a fairly fixed response to the phatic expression, but 
it is not so as far as (1) is concerned. It is interpreted both as a greeting 
expression and as a question requiring an affirmative or a negative 
reply, and is ordinarily answered in terms of any of the variants of "yes" 
or "no" or "not yet", "going to", etc.; in other words, the response is not 
fixed, although some response is required and null response is 
unacceptable. Either as a conversation starter or as a continuation of 
(1), one could use (2): 


(2) kana tarakari heithila? 
What - curry - be-perf 


(What curry was made?) 

The hearer interprets it correctly as a form of greeting, and 
could respond by either giving the information asked for or giving an 
- appropniate non-committal and evasive reply. Both are considered 
acceptable responses, and either could give rise to some more talk. 
Worthy perhaps of attention is that the question is not framed about 
rice or pulses, which are part of a meal too, but about curry, which 
could be a cover term for a range of dishes - vegetarian, non vegetarian, 
fried, mashed - which need not even include what strictly speaking 1s 
curry - the preparation with rather thin gravy. It may be because in the 
traditional, ordinary, minimal Oriya meal at least in the villages, curry is 
the item that permits some variation, rice and black gram daal being the 
fixed ones. 


A few decades ago, when a person returned to his village after 
visiting some distant place, he was sometimes affectionately asked by 
the village elders about what that place was like with the following: 


(3) ..sethi kana dekhila? kana khaaila? 
there - what - saw-2p? what - saw-2p? 
(What did you see there? What did you eat?) 
Seeing and eating constituted a cover term for a variety of the 


person’s experiences in the place he had returned from. (3) was used 
somewhat more to express cordiality towards the person who had 
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returned home than to really know from him what all he had 
experienced. But literalness is not completely absent in (3); that is why 
a brief answer giving some information about the kinds of food 
available or not available, cost of food, etc. even in a casual manner was 
quite acceptable. 


It may be noted that food talk is conspicuous by its absence in 
literary texts, even in plays and works of fiction. There is a rare example 
of food talk in the form of (3) in the Oriya novel kanaa maamu (‘blind — 
uncle’). This occurs in a conversation between a Hindu young man and 
a Muslim elder who comes to visit this young man and his friends at 
their village gymnasium. The relation established between the young 
men and the visiting elder is like that between the grand children and 
the grand father from the mother’s side. This-is a relation where leg 
pulling and jokes are permitted, including the ones with mild sexual 
undertones. The visitor tells the young men about his passing through 
Lanka on his way to Mecca on pilgrimage. In order to impress them he 
tells them things that they know are clearly false, but he doesn’t know 
that they know. This creates a perfect situation for some pleasant leg 
pulling. One young man asks him what he ate there (sethi kana 
khaaila?). The objective is obviously not to find out what he had eaten, 
since they know that there was no truth in what they were being told; it 
is to please the visitor, make him feel good and encourage him to talk. 


The following occurs in informal interaction of both the rural 
and the urban population, especially the people who are not generally 
uncomfortable in hie: 


(4) kebe khaibaaku dauca? 
when - eat-non-finite-post position - give-prog-2p 


(When are you feeding us?) 


This is a friendly way of telling the hearer that something good is 
coming his way (he has got a job, an increment or a promotion, or 
found a girl. for himself, etc.), and of congratulating him for it. In some 
instances this is simultaneously a way of mildly complaining to the 
hearer that: although he did not tell the speaker the good news about 
himself, "the speaker knew of it. Thus in (4) eating is used not in its 
literal sense, but-in the sense of celebration. 
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In the coastal town of Puri, one occasionally comes across the 
use of (5) as part of informal and cordial interaction: 


(5) chenaa chakataa te hau* 
cheese - mash-nom - part - be-subjunctive” 


(Arrange a cheese party.)° 


Its conversational function is to bring some casualness into the 
interaction or relief in a rather not-so-pleasant conversation, as part of 
the effort to maintain cordial social relations. We are not sure whether 
such an expression is part of the local idiom of Puri and its 
neighbourhood alone. 


Such food metaphors as the following could perhaps be 
considered as instances of food talk. Consider (6) and (7), containing 
lexical couplets tela luna (‘ordinary, day-to-day’; literally ‘oil-salt’) and 
saaga bhaata (‘very ordinary food’; literally ‘green leaves-rice’) respectively: 


(6) taara tela lunara sansaara kaunasimate calijaauthilaa 
he -poss - oil-salt - world - somehow - going on-prog-past 


(He was somehow managing.) 


(7) se saaga bhaata khaai calijaauthilaa 
he - green leaves rice - eat-non-finite - go on-prog-past 


(He was managing with very ordinary stuff.) 


Very ordinary existence is expressed in terms of food. In saying 
(6) or (7) one is not talking about or inviting attention to food, but to 
the difficult situation of the referent. Now if the lexical couplets in the 
above are metaphors of deprivation, khin pun (‘pudding - puri’) or khiri 
khechadi (‘pudding rice cooked with black gram’) are metaphors of 


prosperity. 


4. Variations are possible; in place of chenaa chakataa, there could be 
cudaghasaa daalamaa (‘crushed, sweetened flattened rice, and daal mixed 
with vegetables’), etc. or a cover term such as khiaa piaa (‘party’): 


5. Te here is not the colloquial version of the classifier taa, bit a ‘sub-dialectal 
conversational particle that expresses informality. | Ye 


6. Chenaa cakataa is a local delicacy; sweets, bananas, etc. are’ @fushed with 
homemade cheese and served in lumps as sweet dish. . 
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The following, which is a proverb, might be viewed as an implicit 
food expression: 


(8) soilaa puara bhaaga naahin 
sleep-rel.part - son-poss - share - neg-be-prs 


(The sleeping child has no share.) 


The basic sense of bhaaga here is share in food; but it is not so in 
every instance. Sometimes the predicate, as in (9) and sometimes the 
verb phrase, not the predicate alone, as in (10), determines the sense of 
this nominal, as in: 


(9) hari taa bhaaga khaaidelaa 
Hari - he-poss - share - eat up- pst 


(Hari ate up his share.) 


(10) han bhaagare sahe aamba ghacha padilaa 
Hari - share-in - hundred - mango - trees - fall-pst 


(A hundred mango trees fell in Hari’s share.) 


Bhaaga remains ambiguous when the context does not 
contribute to the restriction of meaning. Consider (10): 


(11) hari taa bhaaga raamaku delaa 
Hari - he-poss - share - Ram-acc - give-pst 


(Hari gave his share to Ram.) 


Dissociated from the context, share in the above could refer to 
anything: from food to money or property. 


There is a code of food talk; it is an expression of power 
relations, as are other codes such as the address code or the dress code. 
One important aspect of food talk is that it is horizontally or downward 
oriented; the norm is that it is used among equals, or by the speaker 
who has a higher. social status vis-a-vis the hearer. Deviation is rather 
rare. Whenn an’ interaction the social superior uses any of (1) - (5), it 
is viewed*as. an ‘expression of good will and even a degree of cordiality 
for the hearer, r, although i in some context - say, when the hearer is fairly 


ijdl Vol. XXXVII No. 1 209 Patnaik 


highly educated and has a decent job - some hearer might regard it as 
condescending behaviour on the speaker’s part. But when the social 
superior is the recipient of such talk, it is viewed as unacceptable, 
something not done. A generous interpretation of it is that the speaker’s 
behaviour is rather cheeky. On occasions, for example, when the 
addressee’s status is very low with respect to the addressor’s, it can even 
be construed as rather rude behaviour. Similarly, when someone whose 
social status is very high in relation to that of the addressee says (2), let 
alone (4), to the latter, it is very condescending, in fact, crudely so. As 
far as (6) and (7) are concerned, these describe the situation of a 
certain person who is the subject of the talk, and these can be uttered 
even in his presence, and then it expresses the speaker’s sympathy for 
that person. When the speaker, who is well-to-do, utters versions of 
these with appropriate grammatical changes to talk about himself, these 
are regarded as politeness expressions, but when the speaker is poor, 
then these expressions sometimes tend to get interpreted as attempt by 
him to elicit the hearer’s sympathy. 


Il 


Food talk is not confined to Oriya. One finds instances of food 
talk in other Indian languages too, for example, Bengali, Hindi, Telugu 
and Kannada. 


Consider the following from standard Bengali: 


(12) khaawaa daawaa hoiche (kt) 
eating , etc.- has taken place - (Q word) 


(Have you had your meal?) 


(13) bhaat khiecho? 
rice - have eaten 


(Have you had your meal?) 


(14) aajke ki raandhnaa holo? i ies ea 
today - what - cooking - happened | “ . of 
(What has been cooked today?) | 
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(15) aaj ki khele? 
today - what - ate 
(What did you have today?) 


A word about the gloss: khaawaa daawaa in (12) is a lexical 
couplet, in which the second word does not have the meaning of "give" 
that it has when it occurs independently. In (13) bhaat is the cover term 
for the major meal, as it is in Oriya. In (14) the implicit reference is not 
to rice or daal, the minimal items, but to the curry, as in (15). More 
used in rural Bengal than urban Bengal, these "good will" expressions 
are often used by elders and social superiors when they respond to the 
greeting of the youngsters and the ones lower on the status hierarchy. 
When used:as such, they do not require literal interpretation by the 
addressee, who merely has to give some response, including a 
non-specific one. 


In Telugu, the following have the same function: 


(16) annam tinnaawaa? 
food - have eaten 


(Have you had your meal?)’ 


(17) troju mi inclo eem kura cesaaru? 
today - you - home - what - curry - made 


(What curry did you make at home today?) 
In Kannada, the following expressions of greeting: 


(18) utaa aaitaa? 
food - have eaten 


- (Have you had your meal?) 


(19) tindi / kaaphi aaitaa? 
snacks / coffee - have eaten 


(Have sig Hess your snacks / coffee?) 


a 


7. This is in Fapalaneetita Telugu; the following is its equivalent in the coastal 
_ dialect: bhoncesaaraa. 
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Among the urban educated, who are no strangers to each other, 
these are not used unidirectionally; a younger person can greet and 
initiate talk with an elder using the above. Around breakfast or teatime, 
one could use (19), and lunch / dinner-time, (18). 


As in Oriya, in Hindi, a comfortable life style is expressed in 
food terms. Consider the following, which belongs to the Kanauji 
dialect of Hindi: 


(20) we halwaa puri khaayaa rahe 
he - halwa puri - eating 


(He is having a great time.) 


In the same dialect the following constitutes an interesting 
response to the news that someone is. leading a prosperous life 
somewhere. The hearer does not believe it, and expresses his disbelief 
using (21): 


(21) huaan badi heshmi dahri hai 
there - a lot of - heshmi - kept - is 


(There is abundance of heshmi in that place.) 


Now heshmi appears to be some imaginary rich dish that 1s 
invoked by the elder generation to talk about something almost 
impossible. 


Ii 


Food talk is metaphorical, saying something in terms of 
something else, such as calling a crooked person jilebi (a syrupy sweet 
that has a crooked form), an easy task halwaa (a sweet dish), a boring 
person kadoo (pumpkin) in Hindi, or expressing good will by using 
language that relates to food in some way. As (1) and (2) and their 
cognates in other Indian languages would suggest, food talk is part of 
the repertoire of politeness expressions in these languages, which 
accounts for some of its crucial features such as its unidirectionality, 
and its sensitivity to power relations among the interactants. We might 
speculate why good will is expressed in food terms-in at least some of 
our languages. It is well known that in our tradition food enjoyed an 
eminent place since it was believed to be connected with not only the 
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physical but also the physical and spiritual well being of a person, and 
the importance of food in the day-to-day life of the poor villager 
especially in the feudal society and economy can hardly be 
over-stressed. It is presumably in this context that enquiring about the 
well being of a person came to be synonymous with asking whether he 
had eaten (1) or had eaten well (2) - eating a curry as part of the meal is 
more often than not a poor man’s luxury. Now if (1) and (2) are 
implicitly good will expressions, then one would reasonably expect the 
language to have explicit good will expressions too. Oriya indeed has 
the following which often function as alternatives to (1) and (2): 


(22) sabu bhala (ta)? / bhala acha? 
all - good - Q part / good - be-pr-2p 
(Is everything all right? / Are you fine?) 


To conclude, the paper draws attention to food talk in Oriya, 
which has remained unnoticed so far, by scholars working on this 
language. It deals, in some detail, with only those constructions that are 
greeting expressions. It maintains that food talk is by no means 
restricted to Oriya, and instances of it can be found as well in Bengali, 
Hindi, Telugu and Kannada, among other Indian languages, and 
speculates how some food talk expressions may have come to be used 
as expressions of greeting. 
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Notes and Discussions 


MOTHER-TONGUE INTERFERENCE IN THE ENGLISH 
VOWELS OF MALAYALI SPEAKERS IN ENGLISH 


RAJALAKSHMI V.R. 
S.N. College, Varkala 


Brian Harrison has clarified in English as a Second and Foreign 
Language that "The different speech sounds used in making distinctions 
of meaning in any language are the phonemes of that language" (20). In 
English we have the vowel sounds, diphthongs, consonants and 
semi-vowels. All these sounds together form the phonemic inventory of 
English. The phonemic inventory of one language with its set of 
contrastive speech sounds is always different from’ that of another 
language. A second-language learner has the tendency to adapt the 
system of the target language to suit that of his mother-tongue. As 
Robert Lado puts it: 


"When there is no phoneme in the native language that 
could be transferred to the foreign language and actually 
function as the phoneme in question, the student will not be able 
to produce that phoneme readily in learning the foreign 
language. He will substitute some other phoneme from his native 
stock" (13). 


In India, English is one of the two official languages and has an 
important place in the central and state legislatures, the judiciary, 
industry, commerce and trade besides the media, both print and 
electronic. The existent linguistic diversity of India is largely responsible 
for the growth of English as an important link language in the country, 
particularly among the educated. 


Over the years, English has penetrated deep into the Indian 
society creating a wide variety of variant forms. The development of 
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new varieties of English is linked to social and historical factors. In the 
Indian set-up, English language is reshaped by the socio-cultural setting 
of the particular area where the language is used. Thus a number of 
regional varieties of English have evolved, depending on_ the 
socio-linguistic peculiarities of the region. English has definitely 
acquired a certain regional flavour among most of the Indian speakers, 
which distinguishes it from the varieties of native English. R.K. Bansal 
points out: 


"Indian English as spoken by educated people in India 
does not differ radically from native English in grammar and 
vocabulary, but in pronunciation it is different from both British 
and American English and even within India there are a large 
number of regional varieties, each different from the other in 
certain ways and retaining to some extent the phonetic patterns 
of the Indian language spoken in that particular region" (1). 


English in India is gradually detaching itself from its native 
variety as it is happening with non-native varieties of English in other 
parts of the world. The vitality of English in a particular area depends 
on the necessity of acquiring enough proficiency in the language for 
communicating intelligibly among the bilinguals of that particular area. 
In a region where the inhabitants have no live contact with native 
English speakers, their oral and written expression of English will go 
through divergent development into a regional variety. 


Kerala is considered to be the first fully literate state in the 
country and has centuries-old history of fruitful interaction with foreign 
languages like Portuguese, French and English. Of all these foreign 
languages, it is English that enjoys the highest status and greatest 
demand in the state. Unemployment compels a great number of people 
of Kerala to leave their home-state in search of jobs. Against this 
background, English is seen as a passport to a job and survival outside 
the state. English has been the second language in Kerala ever since the 
introduction of English education in schools and colleges. Though 
English enjoys a great importance in the curriculum and a high status in 
the society, the standard of spoken English is low. Very often the 
pronunciation of an L2 learner is not sufficiently intelligible or 
acceptable. The teaching of English is often almost a_ bilingual 
explanation of textual material. Many teachers make use of the 
grammar and. translation methods while teaching English, either 
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because they are not convinced of the efficacy of the oral approach or 
are not well-equipped to practise it. In their article World Englishes, 
Braj B. Kachru and Cecil L. Nelson remark "in most cases it is taught to 
non-native speakers by non-native speakers, neither teachers nor 
students (who themselves become the next generation of teachers) ever 
having any contact with a native user" (18). 


Our teaching methods are such that comprehension of ideas has 
become our sole aim in learning English as a second language. In the 
earlier years of English education, the need was to teach the Indian 
learners to write English without mistakes so as to enable them to 
handle official communications in writing. The need to cultivate an 
acceptable standard of pronunciation and accent was not given due 
importance. The present-day jobs require good communication skills 
and listening skills. Comprehension skills have to be developed not only 
by studying from a written piece of work but also by listening to actual 
instances of speech. For this, the L2 learner must definitely be 
- conversant in the native pronunciation to which we so far have not given 
much importance. 


Now it is an accepted fact that we should begin to learn a foreign 
language through its spoken form. One should first understand the 
spoken language and then be able to speak it. A child should first be 
allowed to develop his listening and speaking skills before he 1s 
introduced to writing skills. This is not the method that we follow in our 
schools, and even when the learners get a chance to listen, it is not the 
acceptable variety of the language that is introduced to them. As a 
result, the students are often exposed to deviant phonological errors. 


In the learning of a Second Language, the habits already 
acquired in his L1 acquisition do have considerable influence. Each 
language is a different system with its own structure, and in foreign 
language learning, there is a tendency to learn and speak on the basis of 
one’s own mother-tongue. The influence of the source language on 
target language poses a major problem for learners to pick up an 
acceptable pronunciation of English. 


In the English spoken by Malayalam speakers, the phonetic 
interference of the source language is felt so much that their English is 
hardly intelligible to the speakers of other Indian languages, Jet alone 
native speakers. Differences in mother-tongue and the background of 


. 


sf 
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speakers create different speech patterns, which often lead to 
unintelligibility. It is therefore essential for a second-language learner 
to understand the distinct phonetic features of the target language, and 
to find out its difference from the phonetic features of his 
mother-tongue. This will help the second-language learner to improve 
his pronunciation and make his English more intelligible. 


The present study has attempted to analyse the phonological 
deviations in the pronunciation of the vowel phonemes by Malayali 
speakers. The study aims at an investigation of the extent to which the 
English vowel speech sounds used by Malayali speakers diverge from 
the standard variety of English but only the suprasegmental features 
have been listed here. It should also be noted that the suprasegmental 
features of these speech sounds, likely to show further variations, are 
not included in the purview of this study. The informants are ten 
students selected at random from the Medical College High School, 
Thiruvananthapuram. The data for the study consists mainly of 
individual words. 


The recorded data from the voice recorder was transferred to 
the computer. The recording of each informant was listened to carefully 
to observe the phonetic and phonological features of the speakers. The 
features of the pronunciation of each vowel phoneme were noted and 
compared to the pronunciation of these words by native speakers. The 
native model speech (RP) has been taken from the compact disc 
provided along with the Oxford Learner’s Dictionary. Divergent to the 
English speech sounds spoken by the informants from the RP were 
noted. 


A comparison of the vowels of English and Malayalam makes it 
clear that certain vowel phonemes of Malayalam are quite similar to 
that of English. The informants do not find it difficult to pronounce 
these sounds since similar ones occur in their own mother-tongue but 
out of the twelve pure vowels of RP, only seven have similar sounds in 
Malayalam. 


The phonemic inventory of English has seven short vowels and 
five long vowels. They are: 


4. /%/ asinit / It / 


‘ O fels as in elephant / elifont / 


ijdl Vol. XXXVII No. 1 219 Rajalaskhmi 


Dud ce J as in apple / epl/ 
4. /0/ as In OX / Oks / 
= Ae a Ua as in pull /pul/ 
6. /A/ as In us / As/ 
‘fs Wau as in apply / dplai / 
8.507 F / as in east /1:st / 
o/ Prue as in fool / fu: / 
Ly Ae a as in Car / ka: / 
iB pa fe Be as in water /w:td/ 
ee as in bird FA Se We oF 


immo Ai ao th eulf/ Ui, / ul /e/ are sounds 
familiar to Malayali speakers since they occur in the Malayalam 
phonemic inventory. Students do not have much difficulty in producing 
these sounds because of the similarity but there are exceptions where 
the mother-tongue influence is strong when students often deviate to 
Malayalam substitutes. 


Observations have been made from the data collected from the 
informants and the following divergences from the RP have been noted. 
The following are the English vowels which have their analogues in 
Malayalam. Hence deviations from RP are rare. The vowel phonemes 
with examples in the word initial, medial and final positions are given 
below along with the description and examples of their analogues in’ 
Malayalam. 


1./A/ 
Word initially 
E.g. us  /As/ / As / no change in the vowel / A / 
up /Ap/ /Ap/ no change in the vowel / A / 
Word medially 
bus / bAs / / bes / here / A/ 1s replaced by / ¢ / 


butter /bAtd/ /bAtd/ no change in the vowel / A / 
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/ A / does not occur in the final position. Though this sound 
exists in Malayalam, the Malayalam speaker at times pronounces the 
/ A/ sound by substituting it with / e /. 


/a/in Malayalam is open back unrounded short vowel. It occurs 
in the word initial, medial and final position. 


E.g. amma ‘mother’ /kati/ ‘bite’ /kata/ ‘shop’ 

2.42: / 

Word initial 

Eg. arm /d:m/ / a:rm / no change in the vowel / a: / 
art 3 /a:t/ / a:rt / no change in the vowel / a: / 

Medial 


master /mda:sto/ /ma:stodr/ no change in the vowel / a: / 


Final 
car / ka: / / ka:r / no change in the vowel / a: / 
bar / ba: / /ba:r / no change in the vowel / @: / 


/ a: / back open unrounded long vowel in Malayalam. / a: / occurs 
in the initial, medial and final position. 


Eg. /a:na/‘elephant? /ka:td/‘ear’ /ta:/ ‘give’ 


3./1/ 

Word initial 
it /It / /It/ no change in the vowel / 1 / 
is /1Z/ /1S/ no change in the vowel / 1/ 


Some speakers have the tendency to lengthen the |1| sound. 
Word medial 


> sit. / sit / sit / no change in the vowel / 1/ 
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Word final 
city /sliti/ / siti / no change in the vowel / I / 
early /3:li/ / e(r) lt / no change in the vowel /1 / 


/i/ im Malayalam is front close unrounded vowel, occurs initially, 
medially and finally. 


E.g. /1ina/ ‘mate’ /ciri/ ‘laugh’ = / e:ni/ ‘ladder’ 


4./1:/ 
east = /1:st / /1:st / no change in the vowel / 1: / 
eat /1:t / /1:t / no change in the vowel / 1: / 
meet /mui:t/ / mist / no change in the vowel / i: / 
pea /pi:/ / pi: / no change in the vowel / 1: / 
key /ki:/ / ka: / no change in the vowel /1:/ 


There is no change in the vowel / 1: / but / 1: / is lengthened to the 
maximum in words like ‘east’, ‘eat’ and ‘meet’. 


/1: / is front close unrounded vowel in Malayalam, occurs in the 
initial, medial and final positions of words. 


E.g. /i:nam/‘tune’ /citti/‘cotton’ /ti:/ ‘fire’ 


5./u/ 


It does not occur in word initial position. 


In the medial position: 
pull /pul/ /pyl/ no change in the vowel / -v/ 
wood /wvyd/ /wd/ no change in the vowel / */ 
put /put/ /put/ no clase in the. vowel /v7/ 


should / fud/ /fud/ no change in the vowel a 
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In the final position: 
/ju/ 


you /jw/ 


Rajalaskhmi 


no change in the vowel /1/ 


/u/ back close rounded short vowel in Malayalam, occurs in the 


word initial, medial and final positions. 


E.g. /uppd/ ‘salt’ 


Ste 
ooze /u:z/ /u:z/ 
food / fu:d’/ /fud/ 


bamboo/ bembu:/ /bzmbu: / “ 


do / du: / {dtr / 


/ putava / ‘dress, clothing’ 


/ po:ku / ‘go’ 


no change in the vowel / u: / 


4 
long vowel / u: / is represented 
by short vowel /U / 


no change in the vowel / u: / 


long vowel / u: / is represented 
by short vowel /-u- / 


/ u: / back close rounded vowel in Malayalam, occurs in the word 


initial, medial and final positions. 


E.g.  /u:ma / ‘dumb’ 


fel 
Word initially 
elephant /elifont/ /elifdnt / 
elder /eldd/ /eldo(r) / 
Word medially 
met / met / / met / 
head / hed / / hed / 
melt darieit | —- aoke / melt / 


/ ku:ti/ joined’ 


/ pu: / ‘flower’ 


no change in the vowel / e / 


no change in the vowel / e / 


no change in the vowel / e / 
no change in the vowel / e / 


no change in the vowel / e / 


. ,& /e/in Malagaiciat is front unrounded vowel between half close 
and half open, occurs easel, medially and finally. 
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E.g. /evite/‘where’ /tekkd/ ‘south’ / puRame / ‘outside’ 


There are a few pure vowels which are absent in Malayalam. 
They are: 


Ls ee / ket / ‘cat’ 
2./0/ /ppt/ ‘pot’. 
Bi tates Peaen | ‘corn’ 
4./0/ / dplai / ‘apply’ 
ey /b3:d/ ‘bird’ 


The native speaker of Malayalam finds it difficult to articulate 


sounds that are unfamiliar to him. He therefore substitutes the sounds . . 


of the target language with the sounds of the mother-tongue. The 
following examples show how the students substitute the sounds of RP 
with the sounds of their mother-tongue. 


of i. 
It is difficult for the native speakers of Malayalam to 
articulate / z /. Hence a native speaker tends to replace / 2 / with / a: / 


and / A/. 


ant / ent / / a:nt / / 2 / is represented by the long 
vowel / a: / of Malayalam 


apple /zpl/ /a:pl / / 2 / is represented by the long 
vowel / a: / of Malayalam 


man /men/ /ma:n / / z / is represented by the long 
vowel / a: / of Malayalam 


cat / ket / / kjAt / / 2 / is represented by/ A / 
/ 2 / does not occur word finally. 
2./D/ 


There is a tendency to articulate I’ / by substituting it pith the 
Malayalam / 0 / and/ o: /. 
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Ox /Dks / / oks / /'D/ is replaced by the vowel 
/o/ of Malayalam 


oxygen /Oksidz dn/ /oksid% dn/  / P/is replaced by the 
vowel / o / of Malayalam 


hot /hvt/ / host / /Y/is replaced by the vowel 
/ o: / of Malayalam 


college /kw hd / /ko:le:dz/ - /9/1s replaced by the vowel 
/o/ of Malayalam 


/90/ does not occur in the final position in words. 
BAD; / 


The long vowel / 3: / of English is often represented by Malayali 
speakers with / a: / and / o: /. 


all /D:1/ /a:il/ /3:/is replaced by the vowel 
/a:/ of Malayalam 


water /w3:td/ /wa:stdr/ / 3: / is replaced by the vowel 
/a:/ of Malayalam 


corn /k23:n/ /ko:n/ / 3: /1s replaced by the vowel 
/ o: / of Malayalam 


paw /p3:/ / po: / />:/is replaced by the vowel 
/ o: / of Malayalam 


door /d92:/ / do: / /2>:/1s replaced by the vowel 
/ o: / of Malayalam 


4./0/ 


The / 0 / sound of English is totally confusing to a Malayali 


} apply "| fd DRL te ‘Aplai / /0/1s replaced by the vowel 
fs | / A/ of Malayalam 
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appoint /dpvint/ /Appint/  /@/is replaced by the vowel 
/ A/ of Malayalam 


stubborn /stAbdn/ /stAbo:n/ — /@/is replaced by the vowel 
/ 0: / of Malayalam 


father /fa:%0/ / fa:tdr / no change in/ 0 / 


mother /mAB9/ /mAtor / no change in/ 0 / 


Sahih = 7 
The / 3: / sound is absent in Malayalam and divergences are 

rare. 

Earl /3:1/ /3:1/ no change 

Earth /3:0/ /e:(r)0 / / 3:/1s replaced by the vowel 

/ e:/ of Malayalam 
bird /b3:d/ /b3:d/ no change 
shirt /f 3:t/ / fArt/ / 3:/1s replaced by the vowel 


/ A/ of Malayalam 


prefer /prif3:/  /prifA(r)/  /3:/is replaced by the vowel 
/ A/ of Malayalam 


slur /sla:/ / slA(r) / / 3:/is replaced by the vowel 
/ A/ of Malayalam 


From the above data, it can be seen that there are deviations in 
the pronunciation of English vowel phonemes by Malayali speakers. 
The causes for these divergences can be attributed to a number of 
reasons. The absence of certain vowel phonemes in the phonemic 
system of the source language (Malayalam) is one of them. For 
example, the / 9: / of RP which does not exist in Malayalam is replaced 
by / a: / and / o: /. The vowel / e / is replaced by / a: /, and /D / is 
replaced by the / 0 / of Malayalam. / 0 / is replaced by / A / ang MS: / is 
replaced by./ ¢: / and/ A / of Malayalam. 
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It can be observed that the influence of the vowel system of 
Malayalam (the source language) on that of English (the target © 
language) has caused the deviations. When the second-language learner 
begins learning English, he has already acquired a system of 
communication which controls his responses to whatever he learns at 
later stages. He can learn new sounds only in terms of the system of 
communication in which he is proficient. Most of the learners are found 
substituting many sounds of the target language with the sounds of the 
mother-tongue. Malayalam is a Dravidian language which has a sound 
system and syntactic system different from that of English which 
belongs to the Indo-European family of languages. A Malayali speaker 
who learns English as a second language will definitely face difficulties 
since the new phonology of L2 is built up by the learner using L1 
phonology as a base. 


Today there are many varieties of English but RP English 
continues to be looked upon by many as the standard. Those concerned 
with the correctness of the language fear that English is "becoming less 
and less English ...." (20) (Kachru & Nelson) and they have also voiced 
the fear that the:varieties of English will soon become mutually 
unintelligible and before long will be undeserving of the title of 
‘English’. Quirk in The English Language in a Global Context says that 
all English-speaking or using countries should accede to "a form of 
English that is both understood and respected in every corner of the 
globe where any knowledge of any variety of English exists" (6). 


In this context, it would be advisable to introduce new 
techniques at the school level to enable students to listen to models of 
native speech. The curriculum needs to be so restructured that equal 
importance is given to all the four language skills while teaching 
English, especially at the elementary level. This calls for greater 
orientation of policy-makers and the teaching community to achieve the 
goal of neutralizing the deleterious influence of L1 in the Malayali 
speakers of English. 
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RUQPUCQDOQOSEUEEOEOQGGCROREEEERDESOCEECERSSCCSSRREOS EOE E EERE 
~ A LINGUIST VISITS BANGLADESH : 
= Sanghamitra Saha, 2001, PB, Demy 1/8, pp. iv+96, Rs. 75/- (US$ 5/-) ‘ 
=: A most interesting travelogue of a linguist whose iP 
=: parents came from Bangladesh and several relatives still live ‘= 
=| there. It reads like a detective novel. ‘= 
=| . BRAIN AND LANGUAGE (Seminar Proceedings) ‘= 
rt PA. Suresh, Annie Monsy & S. Maya (Eds. 1994, HB, Demy 1/8, : 
=| pp. xiv+214, Rs. 275/- (US$ 75)-) : 
= A compilation of 7 papers on various aspects related ‘= 


=, to the connection between brain and language. The papers «= 
=, deal in depth with the study on how brain damage can «= 
‘ms: disrupt the use and system of language functioning in «= 
=, children as well as in adults and looks at unique i= 
=| Opportunities to find out more on the anatomo- is 
=, physiological organization of the human brain, and in is 
=, identifying the physiological components in the mental is 


= processing of language. z 
> CALDWELL AND A.R. RAJA RAJA VARMA : 
=: ON MALAYALAM GRAMMAR a 
“| K. Raghavan Pillai, 1996, Demy 1/8, pp. 168, Rs. 250/- (US$ 25/-) - 
= A comparative study of the views of two prominent {- 


=, grammarians, this work attempts to compare the views of ‘= 
=, Caldwell and Rajaraja Varma (A.R.) on Malayalam, its {5 


= relation with Dravidian in general and Tamil in particular. f 
=, The work aims to analyze critically the introduction (Pitika) {5 


=| by A.R. to Keralapaniniyam, vis-a-vis an evaluation of {5 
=! Caldwell’s views on Malayalam grammar and its !5 


=: development. The discerning critic will find this work ‘ 
=| informative and stimulating. im 
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Review 


DANCE OF DIVINE LOVE - THE RASA LILA OF 
KRISHNA FROM THE BHAGAVAT PURANA, INDIA’S 
CLASSIC SACRED LOVE STORY. Schweig, Graham 
M. 2007. First Indian Edition. Delhi: Banarsidass. 389 + 
Xxxi pages. Rs. 650/-. 


Reviewed by 
T. MADHAVA MENON 


Bhagavata Purana (hereinafter BhP) is probably the most 
popular text revered as holy in many Hindu households. It has been 
translated into most Indian languages - in Malayalam alone, there are 
several - and many of these translations come with voluminous notes 
and explanations. Rasa Lila is in the 10th Book (Dasamaskandham), 
which describes Vishnu’s 8th incarnation as Sri Krishna. For various 
reasons, Krishna has been the best-loved Hindu deity, probably because 
He is the most susceptible to human reactions like indulgence in sexual 
love, intrigue and military prowess. His early life and dalliance with 
innumerable women has been subject matter of much literature. He is 
the centre of the Bhakti Cult that supplanted Vedic Hinduism and 
highly philosophical schools like Advaita, etc. Charismatic saint-leader 
Chaitanya Mahaprabhu, himself revered and worshipped as 
embodiment of Sri Krishna, popularised the cult all over India, 
especially Bengal and Orissa. The cult produced great literature - Gita 
Govindam (Jayadeva) and Sriknshnakamamrtam (Vilwamangalam or 
Lilasuka) in Sanskrit, and many works of great merit in other Indian 
languages. It also attracted European and other foreign scholars, both 
because of the excellence of the creative expressions, as well as the 
philosophical and historical significance of the movements that the Cult 
gave rise to. 3 kt 
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Schweig attempts, in book under review, deep study of five 
chapters containing Rasa Lila. He furnishes translation of the original 
into a sort of prose with lines of unequal length and randomly 
distributed capital letters, which, though easy to understand, does not 
give any indication of the musical diction and sonorous magnificence of 
the original. He furnishes the original text in Devanagari script with 
English transcription at the end of the book. The original Sanskrit is 
untranslatable into a language as alien as English. Schweig attempts to 
bring out multiple meanings to which the text can be interpreted (S/ésa) 
by laboriously explaining them in expansive notes. He proceeds to 
"illuminate" the text with explanations; supplemented with "Notes and 
Comments’". The Sanskrit Text, more notes, glossary, bibliography and 
index add to the comprehensiveness of the treatment. The book reads 
like a very competently edited Ph.D. dissertation. 


The author leans exclusively on writings of other Occidental 
scholars, and literature produced by Chaitanya school of Bengal. He 
dismisses the earlier devotional cults of Saiva Siddhanta in a brief 
footnote: 


"In this matter, Rudolf Otto states, "the special problems 
of the doctrine of grace have been developed more acutely and 
in greater detail among the Vishnu-bhaktas than among those of 
Siva" ... In spite of the evolved forms of bhakti in the Saivite 
tradition, especially in the Saiva Siddhanta of south India, an 
ultimate veil of monism, in my estimation, seems to eclipse a fully 
developed bhakti theism of the sort found in Vaishna sects" 
(page 107). 


This tacit assumption that dualism is diagnostic of Bhakti cults is 
not entirely defensible. Even in Dasamaskandham, many verses admit 
monistic interpretation. The next, viz. Ekadasam, contains explicitly 
monistic assertions. Bhagavad Gita supports several monistic 
interpretations. Sankaracharya used it as a main authority to support 
his theory of Absolute Monism, with maya as the indefinable 
(anirvacaniya) factor hiding the true nature of Ultimate Reality. 
Vaishnavite Saint Ramanuja "modified" Sankara’s "General" theory into 

a "Special" one, Visistadvaita, in which God is simultaneously Immanent 
and Transcendent leads to praxis of saranagati (total surrender to the 
lord). As will be shown below, Schweig’s ignoring of these aspects has 
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led him into conceptual incompleteness in the treatment of maya, as 
well as the ethical implications of Sri Krishna’s amorous exploits. 


Though the original is not presented as a "Drama", Schweig 
prefers to treat it as such - it satisfies all technical definitions of a 
Sanskrit Drama. He explains the three different layers of narration in 
which the text is presented, viz., prologue, narrator and actors. It is 
narrated by Suta in Naimsharanyam to assembled sages. He reports and 
recounts how Sri Suka, illustrious and celibate son of Sri Vyasa, "editor" 
of Vedas, narrated the story of Sri Krishna as an avatar of Vishnu to 
Parikshit, awaiting death. The reportage includes dialogue between 
Suka and Parikshit, as interludes in the narration of the main story. 
Schweig has presented the latter as preludes and intermissions, the 
main “action” having been provided by Krishna and the "female 
interest", viz., Gopika-s (cowherdesses). The story is well known, and 
need not be repeated here, though it forms the bulk of the book under 
review. 


Yogamaya: I find the opening stanza the key to the entire discourse: 
Schweig translates it, adequately enough: 


"The illustrious son of Vyasa spoke: 


‘Even the Beloved Lord Turned his mind toward 
seeing those nights Love’s delights, 

in autumn filled with fully taking refuge in 
blooming jasmine flowers, Yogamaya’s illusive powers" 


This is Suka’s version of what happened. As a realized soul, he is 
not merely reporting what contemporaries of the actions concerned 
witnessed, but adds words, short but highly significant, to explain the 
deeper reality that Parikshit, his audience, should perceive. The most 
important part of the key, in turn, is the last line, yogamayamupasntah. 


Schweig translates upasritah as “fully taking refuge in". He cites 
Monier Williams classic Dictionary. But why he should have chosen this 
out of the many possible meanings listed in the great Dictionary is not 
clear. He cites Sridhara, a famous commentator, as authority for 
interpreting (correctly) wpa in the sense of adhikya, intensity, rather 
than "completeness", but prefers the latter. The word asraya is from 
root sray, to affix, and a@srayate would mean "to attach onéself to, to 


ijdl Vol. XX XVII No. 1 232 Menon 


join". Monier Williams glosses the noun form as a "recipient, the person 
or thing in which any quality or article is inherent, or retained, or 
received". It can be interpreted as "depending on, having recourse to, a 
plea, excuse”. 


The entire corpus of Vaishnava literature, including even later 
recitals in the same text, proves that it is not the Lord who is dependent 
on, or having to take refuge in, maya, but the other way around, e.g., in 
verse 5.38, Schweig’s translation "deluded by his (Krishna’s) power of 
maya". Schweig himself cites several quotes from Bhagavad Gita, which 
clearly indicate that it is the Lord who uses maya, to hide His true 
nature and to reveal it to those who please Him. "Repeatedly 
throughout the Bhagavata text, the reader is reminded that Maya is a 
power that belongs to God" (page 133). "Yoga Maya is dependent on 
God, originates in God, and cannot exist without God or act without his 
sanction" (page 134). But, rather inconsistently, Schweig concludes: 


"Yet it is clear that Maya is an independent potency, 
completely separate from his own personal manifestation. If this 
were not the case, Krishna would not be able to submit himself 
to it, as he does in the first verse of Rasa Lila" (ibid.). 


This refusal to concede the totally subsidiary character of maya, 
which is but a property of the Lord, has resulted in the misleading 
translation of this verse, and much of the theorisation that it entails. In 
fact, as will be shown, Krishna does not submit himself to any person or 
emotion in Rasa Lila, but makes the other participants submit to His 
illusory indulgences, as means taking them to greater glory. 


Schweig (correctly) points out; Krishna is Yogeswara, master of 
yogic power. At page 133, he cites a verse (3.7.9) asserting that the Lord 
can never be subjected to Maya. The Lord makes Maya delude the 
husbands of Gopi-s to believe each was at home with the respective 
husband. Schweig has no reason to make a special plea that His 
behaviour prior to Rasa Lila was "exceptional and unexpected". Several 
Female Deities in popular Hinduism, including Mother-Goddesses of 
"common" people, have been called, among other names, "Maya". They 
exercise power in their own right. But adherents of Vaishnava cults 
believe that they are ali subject to and subordinate to their Lord. 
Schweig’s arguments to the contrary are not consistent with the 
cosmology or,.the ontology of Bhagavata Puranam, or of the vast 
literature of various schools of Vedanta, including Dwaita. 
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The great dilemma: Parikshit asks Suka how Krishna could have blessed 
Gopi-s whose understanding of Him was only as a Lover. The answer is 
theological key to Vaishnavite concept of liberation. What is essential is 
the totality of surrender to Him, and not of how His ultimate Nature 
has been apprehended by the devotee. Suka cites the example of 
Sisupala, king of Chedi, who hated Krishna with all his heart and soul. 
This total absorption, though negative, in thought and meditation, 
sufficed for Krishna to "liberate" him, though by cutting off his head! 
Kamsa feared the Lord all the time. Krishna "liberated" him, 
pummelling him to death with his bare hands. Gopi-s loved Him 
carnally with their entire being, and so He ‘liberated’ them by indulging 
their desire physically. 


Parikshit later on asks how the Lord, whose incarnation is for 
protecting Dharma, could commit adultery. The Lord, because He is 
Supreme, has no ‘desires’ unfulfilled; so he could not desire physical 
charms and sexual indulgence. Suka answers: the Most Exalted are 
beyond ethical imperatives that bind only the common herd. Siva drank 
poison that would have destroyed everything else. Schweig translates: 


"Words from exalted souls, 

meant for us, are always true, 

though their actions 

only sometimes can be followed" - (page 74) 


Later authors sought to solve this dilemma in various ways. A 
verse in Devi Bhagavatam explains that when Nara and Narayana were 
out to kill off assorted demons, they could tackle one only by beguiling 
him with a beautiful woman. So, they created divine courtesan Urvasi 
from their thigh. She immediately sought them as her paramours; they 
put her off with the promise that she and her companions would have 
their full of pleasure during their forthcoming incarnation as Krishna. 
So, it was not "really" women with whom Krishna disported, but divine 
courtesans disguised as such. 


"Mythic Stratigraphy": Obeyesekere G., in The Cult of the Goddess 
Pattini (Motilal Banarsidass, Delhi, 1987, pages 283 to 317) points out 
that there are several strata or layers of meaning embedded in every 
myth. There are many indications in BhP that shows such stratification. 


Stratification of the narrative: The narrative is in at least three levels. 
What "happened" is what Suka relates to Parikshit; .but then, there is 
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another level, when Suka and Parikshit discourse on the significance of 
what "happened". This is integrated into yet another level, when the 
Suta re-relates the entire myth to the assembled sages at 
Naimsharanyam, after Parikshit’s son has performed the Sarpasatra 
sacrifice. 


Levels of "Reality": The unique character of most Hindu Purana-s is that 
"Reality" also is multi-layered. In the Rasa Lila, "Reality" experienced 
by the husbands of Gopi-s is that that they were all cosily at home, 
fulfilling "wifely duties". They were "deluded" by Maya. Incidentally, this 
device of the delusion of the husbands is not seen in Harivamsam. 
"Reality" experienced by Gopi-s in turn consisted of vicissitudes of their 
affair with Krishna. He was chatting pleasantly with them and they were 
proud; so he disappeared, and went off with just one of them, whom 
also he deserted when she in turn got proud. They grieved and humbled 
themselves; so he reappeared and made passionate love with them. But 
to Suka and Parikshit, "Reality" consisted in the ways in which devotees 
approached and God reciprocated in kind. And of course, 
ontologically, the entire episode, as indeed the entire physical Universe, 
is but the Maya of the Lord. 


An expertly crafted Literary artefact: If we pierce through the halo of 
holiness cast on this particular piece of literature, we can see that it was 
an expertly crafted poem, in the best traditions of Sanskrit poesy. Great 
poems (mahakavya) must exhibit variety of meter, profusion of literary 
ornamentation (alamkara) including S/esa, descriptions of forests and 
cities, and orgies of sexual love. Valmiki’s Ramayana describes, in 
Sundarakandam, bedroom scenes of Rakshasi-s. Magha describes 
orgies on Raivataka Mountain. Kalidasa details the nuptial night of Siva 
and Parvati. This requirement is met in the BhP through Rasa Lila. 


The "message" is the "massage": This famous dictum of the now-defunct 
Management Guru McLuhan was anticipated by Bhakti literature 
millennia ago. Unlike Veda, Puranas were meant to be disseminated 
among as wide an audience as possible. The aim was to get more 
people, including those of the "lower" castes, and "even" women, to 
become adherent to it. The means were to appeal to each level of the 
people in a way that would be most appealing to each. 


"Soft Porn", coupled with bloody violence, then as now, is the 
most powerful attraction for’most people. By making sex the favourite 
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indulgence of Gods themselves, and preferably with human females 
under worldly circumstances, such as when soft autumn breezes wafted 
the sweet perfume of budding jasmine flowers under a full moon, the 
narrative fascinates common people, especially women. Even today, 
reading Bhagavata Puranam in seven days (saptaham) is increasingly 
popular throughout Kerala. 


The literature contained a higher level of discourse, covering 
ethics, and proclaiming the greatness of personal deities, especially 
Krishna. It foregrounded devotion to Brahmins (brahmana bhakti) as of 
equal if not greater importance than devotion to God Himself - after 
all, one of the pillars on which the greatness of Vishnu rested was His 
(reportedly) exceptional devotion to this community. Local chieftains 
were exhorted to follow in the footsteps of the Lord, making largesse to 
Brahmins, and worshipping them as substitutes for the Lord. Buddhist 
and the Sankara Vedanta monastic traditions had resulted in the great 
respect that their practitioners commanded. It was necessary to storm 
this citadel with high philosophy and sophisticated ontological and 
cosmological arguments. Such disputation continues to this day, 
because the conundrums of the Infinite can never be "Completely’ 
solved. 


Summary: In sum, Schweig has meticulously analyzed and explained the 
verses and the obvious significance of Rasa Lila chapters of BhP. He 
has chosen to limit his horizon by wearing as blinkers the work of 
Occidental scholars and of the Bengali school, to the exclusion of South 
Indian literature. He loses perspective by a wrong appreciation of the 
Lord-Yogamaya relationship. By choosing to interpret the Lord as 
totally depending on His own Maya, he loses the opportunity of 
detecting the different levels of "Reality" represented by the different 
levels of the narration. In other words, he has carefully and accurately 
counted all the trees, but missed the forest. 


However, he deserves congratulation for the dedication that he 
has brought to this great work. The treatment is comprehensive, and 
elaborate. Well worth reading, carefully, but not with "devotion". 
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In addition, I have consulted many Malayalam translations, 
including Thunchat Ezhuthachan’s classic work. The notes and 
explanations in most of them have actually given me the information 
required to appreciate the beauty of this great work of literature. 


I thank Doctor M.G. Sashibhushan, with whom I had a brief 
discussion on this subject. The responsibility for the ideas expressed, 
however, is of course my own. 


4 KERALA PAANINIIYAM OF A.R. RAJA RAJA VARMA 2 


=: C.J. Roy (Tr.), 1999, HB, Demy 1/8, pp. xxviii +332, Rs. 400/- (US$ 40/-) ‘= 


= The original work is a classic in Malayalam E 


=: grammatical literature. In the late 19th and early 20th «= 
=, centuries, there were no compeers to A.R. in any of the i= 
=, South Indian languages. A.R. had an analytical mind. ‘= 
=| Though well-versed in Paniniyam, he did not follow it ts 
=; blindly. Wherever Paniniyam was not found applicable, he ‘= 


=; took an independent course of analysis. ‘= 
=: The English translation enhances the merit of the ‘= 


=: original, providing an unambiguous understanding of it with ‘3 
=: maximum accuracy. An excellent translation of a difficult = 


=: but most valuable text. = 
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OBITUARY OF WINFRED P. LEHMANN (1916 - 2007) 
H.S. ANANTHANARAYANA 


Professor Winfred P. Lehmann, an internationally reputed 
authority on Indo-European linguistics, died on 1st August 2007 in 
- Austin, Texas (U.S.A.). He served for more than five decades at the 
University of Texas at Austin in various capacities. Before joining the 
University of Texas in 1949, he had worked for three years in 
Washington University (1946-49) and for four years in Army Signal 
Corps (1942-46) as Instructor in Japanese. 


Born on 23rd June 1916 in Surprise, Nebraska, Professor 
Lehmann graduated from University of Wisconsin, receiving his M.A. 
(1938) and Ph.D. (1941) in Germanic linguistics. He married Ruth 
Preston who was in her own right an authority in Old Irish poetry and 
taught in the Department of English at the University of Texas. The 
couple is survived by a son (a Professor of History) and a daughter. 


Professor Lehmann was solely responsible for founding the 
Department of Linguistics at the University of Texas of which he was 
Chairman from 1944 to 1972. He attracted many young and promising 
linguists to the Department and developed it to become number two in 
the U.S.A. He directed about 50 doctoral dissertations, one of which 
was on ‘Vedic Intonation’ by H.S. Ananthanarayana. 


Professor Lehmann occupied the Chair of Louann and Larry 
Temple Professor of Linguistics from 1938 to 1986 and continued after 
retirement as Professor Emeritus in Humanities till the end 
(1986-2007). He was also Director of Linguistic Research Centre 
(1961-2007) which has put out courses online for early Indo-European 
languages, and Founder-Editor of Machine Translation (1984-86). 


Professor Lehmann received several national and international 
awards including Doctor of Humane Letters from University of 
Wisconsin (1995), Doctor of Letters from State te University of New York 
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(1985), Collitz Professor of Linguistics at Linguistic Institute in Illinois 
(1968), Guggenheim Fellowship (1972-73) and Harry Huntt Ransom 
Award for Teaching Excellence in Liberal Arts from University of 
Texas (1983). 


He visited many countries which honoured him. He was 
Chairman, Linguistic Delegation to People’s Republic of China, was 
Guest of the Indian Council for Cultural Relations (with lectures in 
Universities and Research Institutes) in 1978, and was awarded Order 
of Merit of the Federal Republic of Germany (Commander’s Cross) in 
1987. In 1981, he was invited to deliver Nehru Memorial Lecture in 
New Delhi, which is a rare distinction conferred on a foreign scholar. 


Professor Lehmann served with distinction im various 
professional societies. He was President of Linguistic Society of 
America (1973) and President of Modern Languages Association of 
America (1987). He was a life-member of American Oriental Society, 
Linguistic Society of India ‘and Dravidian Linguistics Association. 


He has authored many books and an innumerable number of 
articles. He extended the horizon of typological studies in many of his 
writings. His major publications include: Proto Indo-European 
Phonology (1952), Reader in 19th Century Historical Indo-European 
Linguistics (1967), Proto Indo-European Syntax in Generative 
Framework (1974), Historical Linguistics Third Edition (1992) and Pre 
Indo-European (2002). 


One of the projects which was uppermost on his mind during his 
last years was a programme called Hominid, the aim of which was to 
put together information on the early stages of humans. It had to do 
with the early language and languages centered on the Nostratic group 
which meant Indo-European, Afro-Asiatic, Ugric, Altaic, Kartvelian 
and Dravidian. For each of these groups, the aim was to produce a 
grammar of their earliest stage, based on which he had planned to 
reconstruct proto-human. Professor Lehmann was greatly interested in 
India and its people, and visited the country on several occasions. He 
trained several Indian scholars who continue his tradition. He always 
took care of his students and played a great host to them while they 
were away from their own. countries. In his death, the linguistic 
community has lost a great champion of Indo-European studies and a 
master in Historical Linguistics. : 
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